DAA-C01 SnowPro Advanced: Data Analyst Exam sample Question + Exam 2026 Practice Exam Dumps

Question # 4

What does the "SQL keyword" refer to in the context of adding filters to a worksheet?

The name of the function used to generate the filter

The filter name to be inserted into queries

The table name containing the static filter "key" values

The name of a User-Defined Function (UDF) used to define the filter "key" values

Full Access

Question # 5

Consider the following chart.

What can be said about the correlation for sales over time between the two categories?

There is a positive correlation.

There is a negative correlation.

There is no correlation. (Selected)

There is a non-linear correlation.

Full Access

Answer:

Explanation:

In Data Analysis, correlation refers to a statistical relationship between two variables. When analyzing a time-series chart like the one provided, a Data Analyst looks for patterns in how the two categoriesâ€”"Enterprise" (blue line) and "Pro Edition" (yellow line)â€”move in relation to one another over the X-axis (Year).

A Positive Correlation would be indicated if both lines generally moved in the same direction at the same time (e.g., when Enterprise sales increase, Pro Edition sales also increase). A Negative Correlation (or inverse correlation) would be shown if the lines moved in opposite directions consistently (e.g., when one peaks, the other troughs).

Looking closely at the provided exhibit, the fluctuations for both editions are highly erratic and appear independent of each other. For instance, around the year 2008, the Pro Edition (yellow) shows a significant peak while the Enterprise edition (blue) experiences a sharp decline. Conversely, in other sections of the chart, they both dip or rise simultaneously by chance, but there is no sustained, predictable pattern of movement. The peaks and valleys do not align in a way that suggests one variable's movement is tied to the other.

Statistically, this lack of a discernible relationship indicates a Correlation Coefficient near zero. In the context of the Snowflake Snowpro Advanced: Data Analyst exam, identifying "No Correlation" is a key skill for interpreting Snowsight visualizations. It tells the analyst that the factors driving sales for the Enterprise tier are likely distinct from those driving the Pro Edition, and they should be analyzed as independent segments rather than interdependent variables. Therefore, based on the visual evidence of random, non-synchronous movement across the timeline, the only supported conclusion is that there is no correlation.

Question # 6

What functions should a Data Analyst use to run descriptive analytics on a data set? (Select TWO).

REGR_INTERCEPT

REGR_SLOPE

ROW_NUMBER

APPROX_COUNT_DISTINCT

AVG

Full Access

Question # 7

What scheme is used by Snowflake to estimate the approximate similarity between two or more data sets?

MINHASH

APPROX_PERCENTILE

HyperLogLog

APPROX_TOP_K

Full Access

Question # 8

A Data Analyst is working with a table that has 1 record per day, with sales information. Which window function would calculate a 7-day moving average of sales, where SALES_DATE represents the date column?

SUM(SALES) OVER (ORDER BY SALES_DATE ROWS BETWEEN 6 PRECEDING AND CURRENT ROW)

SUM(SALES) OVER (ORDER BY SALES_DATE ROWS BETWEEN 7 PRECEDING AND CURRENT ROW)

AVG(SALES) OVER (ORDER BY SALES_DATE ROWS BETWEEN 6 PRECEDING AND CURRENT ROW)

AVG(SALES) OVER (ORDER BY SALES_DATE ROWS BETWEEN 7 PRECEDING AND CURRENT ROW)

Full Access

Question # 9

A Data Analyst is working with three tables:

Which query would return a list of all brokers, a count of the customers each broker has. and the total order amount of their customers (as shown below)?

Option A

Option B

Option C

Option D

Full Access

Question # 10

A Data Analyst has a very large table with columns that contain country and city names. Which query will provide a very quick estimate of the total number of different values of these two columns?

SELECT DISTINCT COUNT(country, city) FROM TABLE1;

SELECT HLL(country, city) FROM TABLE1;

SELECT COUNT(DISTINCT country, city) FROM TABLE1;

SELECT COUNT(country, city) FROM TABLE1;

Full Access

Question # 11

Which query will provide this data without incurring additional storage costs?

CREATE TABLE DEV.PUBLIC.TRANS_HIST LIKE PROD.PUBLIC.TRANS_HIST;

CREATE TABLE DEV.PUBLIC.TRANS_HIST AS (SELECT * FROM PROD.PUBLIC.TRANS_HIST);

CREATE TABLE DEV.PUBLIC.TRANS_HIST CLONE PROD.PUBLIC.TRANS_HIST;

CREATE TABLE DEV.PUBLIC.TRANS_HIST AS (SELECT * FROM PROD.PUBLIC.TRANS_HIST WHERE extract(year from (TRANS_DATE)) = 2019);

Full Access

Question # 12

Why would a Data Analyst use a dimensional model rather than a single flat table to meet BI requirements for a virtual warehouse? (Select TWO).

Dimensional modelling will improve query performance over a single table.

Dimensional modelling will save on storage space since it is denormalized.

Combining facts and dimensions in a single flat table limits the scalability and flexibility.

Dimensions and facts allow power users to run ad-hoc analyses.

Snowflake generally performs better with dimensional modelling.

Full Access

Answer:

C, D

Explanation:

In the field of data warehousing and business intelligence (BI), choosing the right data model is crucial for long-term maintainability and user accessibility. While a single flat table might seem simple initially, dimensional modeling (typically using Star or Snowflake schemas) provides distinct advantages for enterprise analytics.

1. Scalability and Flexibility (Option C)

Combining all attributes into a single flat table creates a highly rigid structure. Every time a new attribute is added to a dimension (e.g., adding a "Promotion Category" to a product), the entire flat table must be rewritten or altered, which is inefficient for large datasets. Furthermore, flat tables often contain redundant data, leading to "update anomalies" where a change in a dimension attribute must be propagated across millions of rows. A dimensional model separates changing business processes (Facts) from the context of those processes (Dimensions), allowing the schema to scale and evolve independently.

2. Ad-hoc Analysis for Power Users (Option D)

Dimensional models are specifically designed to be intuitive for business users and BI tools. By organizing data into Facts (measurable metrics) and Dimensions (descriptive attributes), power users can easily "slice and dice" data across different hierarchies. For example, a user can quickly run an ad-hoc query to compare "Total Sales" (Fact) by "Store Region" (Dimension) and "Calendar Month" (Dimension). This structure provides a predictable and standardized "language" for the data, making it easier for users to build their own reports without needing a Data Analyst to create a custom flat table for every specific request.

Evaluating the Distractors:

Option A and E: These are common misconceptions. Modern cloud data warehouses like Snowflake are often highly optimized for wide "flat" tables due to columnar storage and sophisticated pruning. In many cases, a flat table may actually outperform a multi-table join (dimensional model) because it avoids the computational overhead of the join itself.

Option B: This is factually incorrect. Flat tables are denormalized (repeating data), which generally takes more storage space. Dimensional modeling is a form of normalization that saves space by storing descriptive strings once in a dimension table rather than repeating them for every transaction in a fact table.

Question # 13

When building a Snowsight dashboard that will allow users to filter data within a worksheet, which Snowflake system filters should be used?

Include the :datebucket system filter in a WHERE clause, and include the :daterange system filter in a GROUP BY clause.

Include the :daterange system filter in a SELECT clause, and include the :datebucket system filter in a GROUP BY clause.

Include the :datebucket system filter in a WHERE clause, and include the :daterange system filter in a SELECT clause.

Include the :daterange system filter in a WHERE clause, and include the :datebucket system filter in a GROUP BY clause.

Full Access

Question # 14

A Data Analyst is working with three tables:

Which query would return a list of all brokers, a count of the customers each broker has. and the total order amount of their customers (as shown below)?

Option A

Option B

Option C

Option D

Full Access

Question # 15

What is a benefit of using SQL queries that contain secure views?

Users will not be able to make observations about the quantity of underlying data.

The amount of data scanned, and the total data volume are obfuscated.

Only the number of scanned micro-partitions is exposed, not the number of bytes scanned.

Snowflake secure views are more performant than regular views.

Full Access

Question # 16

Which Snowflake feature or object can be used to dynamically create and execute SQL statements?

User-Defined Functions (UDFs)

System-defined functions

Stored procedures

Tasks

Full Access

Question # 17

A company is looking for new headquarters and wants to minimize the distances employees have to commute. The company has geographic data on employees' residences. Through the Snowflake Marketplace, the company obtained geographic data for possible locations of the new headquarters. How can the distance between an employee's residence and potential headquarters locations be calculated in meters with the LEAST operational overhead?

ST_HAUSDORFFDISTANCE

HAVERSINE

ST_LENGTH

ST_DISTANCE

Full Access

Question # 18

A Data Analyst wants to transform query results. Which transformation option will incur compute costs?

Showing a thousand separator for numeric columns.

Sorting a column by using the column options.

Increasing or decreasing decimal precision.

Formatting date and timestamp columns.

Full Access

Answer:

Explanation:

In the Snowflake Snowsight interface, it is critical to distinguish between UI-level formatting and engine-level processing. Snowsight provides several client-side features that allow an analyst to change how data is displayed without re-executing the underlying SQL query or utilizing virtual warehouse credits.

Client-Side (No Compute Cost):

Formatting options such as adding thousand separators (Option A), adjusting the visible decimal precision (Option C), or changing the display format of dates and timestamps (Option D) are typically handled by the Snowsight web interface itself. These transformations are applied to the data that has already been retrieved into the browser's local result cache. Because they do not require the virtual warehouse to scan micro-partitions or perform new calculations, they do not incur additional compute costs.

Engine-Level (Incurs Compute Cost):

Sorting a column (Option B) is fundamentally different. While Snowsight allows you to click a column header to sort, this action frequently triggers a re-query or a secondary processing step if the entire result set is not already fully cached in the browser's memory. When you use "column options" to perform operations like sorting, filtering, or grouping on large datasets, Snowflake often has to leverage the virtual warehouse to reorganize the data. In the context of the Snowflake Data Analyst exam, sorting is identified as a transformation that requires active compute resources because the engine must evaluate the entire dataset to determine the new order of records.

Furthermore, even if a small result set is cached, complex sorting across large volumes of data necessitates warehouse involvement to ensure accuracy and handle "spilling" to local or remote storage if the sort operation exceeds available memory. Therefore, while visual "masks" are free, structural data reorganization like sorting is a compute-intensive task.

Question # 19

A Data Analyst has a Parquet file stored in an Amazon S3 staging area. Which query will copy the data from the staged Parquet file into separate columns in the target table?

Option A

Option B

Option C

Option D

Full Access

Answer:

Explanation:

In the Snowflake ecosystem, Parquet is treated as a semi-structured data format. When you stage a Parquet file, Snowflake does not automatically parse it into multiple columns like it might with a flat CSV file. Instead, the entire content of a single row or record is loaded into a single VARIANT column, which is referenced in SQL using the positional notation $1.

The fundamental mistake often madeâ€”and represented in Option Aâ€”is treating Parquet as a delimited format where $1, $2, and $3 refer to different columns. In Parquet ingestion, columns $2 and beyond will return NULL because the schema is contained within the object in $1.

To successfully "shred" or flatten this semi-structured data into a relational table with separate columns, an analyst must use path notation. This involves referencing the root object ($1), followed by a colon (:), and then the specific element key (e.g., $1:o_custkey). Furthermore, because the values extracted from a Variant are technically still Variants, they must be explicitly cast to the correct data type using the double-colon syntax (e.g., ::number, ::date) to ensure they land in the target table with the correct data types.

Evaluating the Options:

Option A is incorrect because it uses positional references ($2, $3, etc.) which are only valid for structured files like CSVs.

Option B is incorrect because it attempts to reference keys directly without the required stage variable ($1) and colon separator.

Option D is incorrect as it uses a non-standard parse() function that does not exist for this purpose in Snowflake SQL.

Option C is the 100% correct syntax. It correctly identifies that the Parquet data resides in $1, utilizes the colon to access internal keys, and applies the necessary type casting. This specific method is known as "Transformation During Ingestion" and is a core competency for any SnowPro Advanced Data Analyst.

Winter Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: myex65

MyExamCollection

DAA-C01 SnowPro Advanced: Data Analyst Exam Question and Answers

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Quick Links

Why Us

Unlimited Packages

Site Secure

We Accept