Snowflake SnowPro Advanced: Data Engineer (DEA-C02) - DEA-C02 Exam Practice Test
You are tasked with creating a SQL UDF in Snowflake to mask sensitive customer data (email addresses) before it's used in a reporting dashboard. The masking should replace all characters before the '@' symbol with asterisks, preserving the domain part. For example, '[email protected]' should become ' @example.com'. Which of the following SQL UDF definitions correctly implements this masking logic, while minimizing the impact on Snowflake compute resources?


Correct Answer: B
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You have a base table 'ORDERS' with columns 'ORDER ID, 'CUSTOMER D', 'ORDER DATE, and 'ORDER AMOUNT'. You need to create a view that aggregates the total order amount per customer per month. However, for data governance purposes, you need to ensure that the view only shows data for the last 3 months. What is the MOST efficient and secure way to create this view in Snowflake?
Correct Answer: D
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
A data engineer is tasked with optimizing query performance on a Snowflake table named 'SALES DATA, which currently has no clustering key defined. The table contains 'SALE (unique identifier), 'SALE DATE, 'PRODUCT CATEGORY, and 'SALE AMOUNT. The business analysts frequently run queries filtering on 'SALE DATE and then aggregating by 'PRODUCT CATEGORY'. Choosing the right clustering keys for the SALES DATA table is crucial for minimizing disk 1/0 and enhancing query speed. Which of the following clustering key strategies would be MOST effective for the specified query patterns, considering both performance and the potential impact on data loading and DML operations?
Correct Answer: D
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
A data provider wants to share a large dataset (several TB) with multiple consumers. The dataset is updated daily. The provider wants to minimize the cost associated with data sharing and ensure that consumers receive consistent data'. Which of the following strategies would be the MOST cost-effective and maintainable?
Correct Answer: E
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You are using Snowpipe to load data from an AWS S3 bucket into Snowflake. The data files are compressed using GZIP and are being delivered frequently. You have observed that the pipe's backlog is increasing and data latency is becoming unacceptable. Which of the following actions could you take to improve Snowpipe's performance? (Select all that apply)
Correct Answer: A,C,D
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You have a data pipeline that loads data from an internal stage into a Snowflake table Craw_data'). The pipeline is experiencing intermittent failures with the error 'SQL compilation error: Stage 'MY INTERNAL STAGE' is immutable'. What are the potential causes of this error and how would you troubleshoot it?
Correct Answer: A,C
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You have a Snowflake table, 'CUSTOMER ORDERS', with columns like 'CUSTOMER ID', 'ORDER DATE', 'ORDER AMOUNT', and 'REGION'. A Bl dashboard relies on a query that aggregates data from this table, but the query performance is consistently slow. The query frequently filters by 'ORDER DATE and groups by 'REGION'. Based on the following 'EXPLAIN' output, which combination of techniques should be considered to improve the performance the most?


Correct Answer: A,C
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
A financial institution needs to implement both dynamic data masking and column-level security on the 'CUSTOMER DATA table, which contains sensitive information like 'CREDIT CARD NUMBER and 'SSN'. The requirement is: all users except those in the 'DATA ADMIN' role should see masked credit card numbers (last 4 digits unmasked) and masked SSNs. Users in 'DATA ADMIN' should see the original data'. Which of the following combination of policies and grants will achieve this?
Correct Answer: B
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
A financial institution is using Snowflake to store transaction data for millions of customers. The data is stored in a table named 'TRANSACTIONS with columns such as 'TRANSACTION ID, 'CUSTOMER ID', 'TRANSACTION DATE, 'TRANSACTION_AMOUNT, and 'MERCHANT CATEGORY'. Analysts are running complex analytical queries that often involve filtering transactions by 'TRANSACTION_DATE, 'MERCHANT CATEGORY' , and 'TRANSACTION_AMOUNT ranges. These queries are experiencing performance bottlenecks. The data team wants to leverage query acceleration service to improve performance without significantly altering the existing query patterns. Which of the following actions or combination of actions would be MOST beneficial, considering the constraints and the nature of the queries? (Select TWO)
Correct Answer: C,E
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You are responsible for monitoring the performance of a Snowflake data pipeline that loads data from S3 into a Snowflake table named 'SALES DATA. You notice that the COPY INTO command consistently takes longer than expected. You want to implement telemetry to proactively identify the root cause of the performance degradation. Which of the following methods, used together, provide the MOST comprehensive telemetry data for troubleshooting the COPY INTO performance?
Correct Answer: B,E
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You have a Snowflake table named 'ORDERS clustered on 'ORDER DATE. After a significant data load, you want to evaluate the effectiveness of the clustering. Which of the following SQL queries, using Snowflake system functions, will provide insights into the clustering depth and overlap of micro-partitions in the 'ORDERS' table, specifically helping you identify whether re-clustering is necessary? Assume that the table


Correct Answer: E
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You are loading data from an S3 bucket into a Snowflake table using the COPY INTO command. The source data contains dates in various formats (e.g., 'YYYY-MM-DD', 'MM/DD/YYYY', 'DD-Mon-YYYY'). You want to ensure that all dates are loaded correctly and consistently into a DATE column in Snowflake. Which of the following COPY INTO options and commands is the MOST appropriate to handle this?
Correct Answer: C
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You are tasked with ingesting data from an external stage into Snowflake. The data is in JSON format and compressed using GZIP. The JSON files contain nested arrays. You need to create a file format object that Snowflake can use to properly parse the dat a. Which of the following options represents the MOST efficient and correct file format definition to achieve this? Assume the stage is already created and accessible.


Correct Answer: A
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).
You are setting up a Kafka connector to load data from a Kafka topic into a Snowflake table. You want to use Snowflake's automatic schema evolution feature to handle potential schema changes in the Kafka topic. Which of the following is the correct approach to enable and configure automatic schema evolution using the Kafka Connector for Snowflake?
Correct Answer: C
Vote an answer
Explanation: Only visible for Fast2test members. You can sign-up / login (it's free).