Implementing Analytics Solutions Using Microsoft Fabric 온라인 연습
최종 업데이트 시간: 2026년06월04일
당신은 온라인 연습 문제를 통해 Microsoft DP-600 시험지식에 대해 자신이 어떻게 알고 있는지 파악한 후 시험 참가 신청 여부를 결정할 수 있다.
시험을 100% 합격하고 시험 준비 시간을 35% 절약하기를 바라며 DP-600 덤프 (최신 실제 시험 문제)를 사용 선택하여 현재 최신 55개의 시험 문제와 답을 포함하십시오.


정답: 
Explanation:
Box 1: Type 2
There are 6 types of Slowly Changing Dimension that are commonly used, they are as follows:
Type 0 C Fixed Dimension
No changes allowed, dimension never changes
Type 1 C No History
Update record directly, there is no record of historical values, only current state
Type 2 C Row Versioning
Track changes as version records with current flag & active dates and other metadata
Type 3 C Previous Value column
Track change to a specific attribute, add a column to show the previous value, which is updated as further changes occur
Etc.
Box 2: Type 1
Reference: https://adatis.co.uk/introduction-to-slowly-changing-dimensions-scd-types/
정답:
Explanation:
The FULL OUTER JOIN keyword returns all records when there is a match in left (table1) or right (table2) table records.

Reference: https://www.w3schools.com/sql/sql_join_full.asp

정답: 
Explanation:
Box 1: append
Mode "append" atomically adds new data to an existing Delta table and "overwrite" atomically replaces all of the data in a table.
Box 2: overwriteSchema false
Explicitly update schema to change column type or name
You can change a column’s type or name or drop a column by rewriting the table. To do this, use the overwriteSchema option.
The following example shows changing a column type:
(spark.read.table(...)
.withColumn("birthDate", col("birthDate").cast("date"))
.write
.mode("overwrite")
.option("overwriteSchema", "true")
.saveAsTable(...)
), when performing an Overwrite, the data will be deleted before writing out the new data.
Reference: https://learn.microsoft.com/en-us/azure/databricks/delta/update-schema
정답:
Explanation:
Power Query activity in Azure Data Factory
The Power Query activity allows you to build and execute Power Query mash-ups to execute data wrangling at scale in a Data Factory pipeline. You can create a new Power Query mash-up from the New resources menu option or by adding a Power Activity to your pipeline.
Translation to data flow script
To achieve scale with your Power Query activity, Azure Data Factory translates your M script into a data flow script so that you can execute your Power Query at scale using the Azure Data Factory data flow Spark environment.
Example:

Reference: https://learn.microsoft.com/en-us/azure/data-factory/control-flow-power-query-activity

정답: 
Explanation:
Box 1: delta
Use a notebook to load data into your Lakehouse
Saving data in the Lakehouse using capabilities such as Load to Tables or methods described in Options to get data into the Fabric Lakehouse, all data is saved in Delta format.
# Keep it if you want to save dataframe as a delta lake, parquet table to Tables section of the default Lakehouse
df.write.mode("overwrite").format("delta").saveAsTable(delta_table_name)
# Keep it if you want to save the dataframe as a delta lake, appending the data to an existing table df.write.mode("append").format("delta").saveAsTable(delta_table_name)
QUESTION NO: NO: The solution must ensure that the content will display automatically as a table named Sales in Lakehouse explorer.
Box 2: files/sales
Reference:
https://learn.microsoft.com/en-us/fabric/data-engineering/lakehouse-notebook-load-data
https://learn.microsoft.com/en-us/fabric/data-engineering/lakehouse-and-delta-tables

정답: 
Explanation:
Box 1: withColumn
In PySpark, we can use the cast method to change the data type.
from pyspark.sql.types import IntegerType
from pyspark.sql import functions as F
# first method
df = df.withColumn("Age", df.age.cast("int"))
# second method
df = df.withColumn("Age", df.age.cast(IntegerType()))
# third method <-- This one
df = df.withColumn("Age", F.col("Age").cast(IntegerType()))
Box 2: col
Box 3: cast
Reference: https://www.aporia.com/resources/how-to/change-column-data-types-in-dataframe/


정답: 
Explanation:
Box 1: A schedule
Orchestration pipeline
Run and schedule the data pipeline
Box 2: A pipeline Copy activity
Bronze layer, pipelines with Copy activities (Lakehouse)
Configure Lakehouse in a copy activity
Use the copy activity in a data pipeline to copy data from and to the Fabric Lakehouse.
Box 3: A pipeline Dataflow activity
Silver layer, dataflows (Lakehouse)
Microsoft Fabric, Data Factory, Use a dataflow in a pipeline
A dataflow is a reusable data transformation that can be used in a pipeline.
Box 4: A pipeline Stored procedure actvitity
Gold layer, stored procedures (warehouse)
Azure Data Factory, Transform data by using the SQL Server Stored Procedure activity in Azure Data Factory or Synapse Analytics
Reference:
https://learn.microsoft.com/en-us/fabric/data-factory/connector-lakehouse-copy-activity
https://learn.microsoft.com/en-us/fabric/data-factory/tutorial-dataflows-gen2-pipeline-activity
https://learn.microsoft.com/en-us/azure/data-factory/transform-data-using-stored-procedure


정답: 
Explanation:
Box 1: ReadAll
User1, Read all the Spark data
If the “Read all Apache Spark” box is checked, users will be given ReadAll. This permission allows users to access data in OneLake. This could be through direct OneLake access, Apache Spark queries, or the lakehouse UX.
Box 2: ReadData
User2, Read all the SQL endpoint data
If the “Read all SQL endpoint data” is checked, users will be given the ReadData permission. ReadData gives access to all Tables in the item when accessing through the SQL Endpoint. Users will not be able to access OneLake directly.
Reference: https://support.fabric.microsoft.com/en-us/blog/building-common-data-architectures-with-onelake-in-microsoft-fabric


정답: 
Explanation:
Box 1: df.withColumnRenamed
Add a column named pickupDate that will contain only the date portion of pickupDateTime.
withColumnRenamed(existing, new)[source]
Returns a new DataFrame by renaming an existing column. This is a no-op if schema doesn’t contain the given column name.
Parameters:
existing C string, name of the existing column to rename.
col C string, new name of the column.
>>> df.withColumnRenamed('age', 'age2').collect() [Row(age2=2, name='Alice'), Row(age2=5, name='Bob')]
Incorrect:
* df.withColumn withColumn(colName, col)[source]
Returns a new DataFrame by adding a column or replacing the existing column that has the same name.
The column expression must be an expression over this DataFrame; attempting to add a column from some other dataframe will raise an error.
Parameters:
colName C string, name of the new column.
col C a Column expression for the new column.
>>> df.withColumn('age2', df.age + 2).collect()
[Row(age=2, name='Alice', age2=4), Row(age=5, name='Bob', age2=7)]
Box 2: cast('date')
cast(dataType)[source]
Convert the column into type dataType.
>>> df.select(df.age.cast("string").alias('ages')).collect() [Row(ages='2'), Row(ages='5')]
>>> df.select(df.age.cast(StringType()).alias('ages')).collect() [Row(ages='2'), Row(ages='5')]
Box 3: .filter("fareAmount > 0 AND fareAmount < 100"
Filter the DataFrame to include only rows where fareAmount is a positive number that is less than 100.
filter(condition)[source]
Filters rows using the given condition.
where() is an alias for filter().
>>> df.filter(df.age > 3).collect() [Row(age=5, name='Bob')]
>>> df.where(df.age == 2).collect() [Row(age=2, name='Alice')]
>>> df.filter("age > 3").collect() [Row(age=5, name='Bob')]
>>> df.where("age = 2").collect() [Row(age=2, name='Alice')]
Incorrect:
*.where
Isin will not give the desired result.
Note: In Apache Spark, the where() function can be used to filter rows in a DataFrame based on a given condition. The condition is specified as a string that is evaluated for each row in the DataFrame. Rows for which the condition evaluates to True are retained, while those for which it evaluates to False are removed.
isin(*cols)[source]
A boolean expression that is evaluated to true if the value of this expression is contained by the evaluated values of the arguments.
>>> df[df.name.isin("Bob", "Mike")].collect() [Row(age=5, name='Bob')]
>>> df[df.age.isin([1, 2, 3])].collect() [Row(age=2, name='Alice')]
Reference: https://spark.apache.org/docs/2.3.0/api/python/pyspark.sql.html

정답: 
Explanation:
Box 1: Value
Query folding on native queries
Use Value.NativeQuery function
The goal of this process is to execute the following SQL code, and to apply more transformations with
Power Query that can be folded back to the source.
SELECT DepartmentID, Name FROM HumanResources.Department WHERE GroupName = 'Research and Development'
The first step was to define the correct target, which in this case is the database where the SQL code will be run. Once a step has the correct target, you can select that step―in this case, Source in Applied Steps ―and then select the fx button in the formula bar to add a custom step. In this example, replace the Source formula with the following formula:
Value.NativeQuery(Source, "SELECT DepartmentID, Name FROM HumanResources.Department WHERE GroupName = 'Research and Development'
Box 2: NativeQuery
Box 3: EnableFolding
The most important component of this formula is the use of the optional record for the forth parameter of the function that has the EnableFolding record field set to true.

Reference: https://learn.microsoft.com/en-us/power-query/native-query-folding
정답:

정답: 
Explanation:
Box 1: Yes
PartitionBy segregates data into folders.
Note: PySpark partitionBy() is a function of pyspark.sql.DataFrameWriter class which is used to partition the large dataset (DataFrame) into smaller files based on one or multiple columns while writing to disk-
Box 2: Yes
Box 3: No
Reference: https://sparkbyexamples.com/pyspark/pyspark-partitionby-example/
정답:
Explanation:
Delta Lake table format interoperability
In Microsoft Fabric, the Delta Lake table format is the standard for analytics. Delta Lake is an open-source storage layer that brings ACID (Atomicity, Consistency, Isolation, Durability) transactions to big data and analytics workloads.
All Fabric experiences generate and consume Delta Lake tables, driving interoperability and a unified product experience. Delta Lake tables produced by one compute engine, such as *Synapse Data warehouse* or Synapse Spark, can be consumed by any other engine, such as Power BI. When you ingest data into Fabric, Fabric stores it as Delta tables by default. You can easily integrate external data containing Delta Lake tables by using OneLake shortcuts.
The following matrix shows key Delta Lake features and their support on each Fabric capability.

Etc.
Reference: https://learn.microsoft.com/en-us/fabric/get-started/delta-lake-interoperability
정답:
Explanation:
The following properties are supported for Lakehouse under the Destination tab of a copy activity.
* Under Advanced, you can specify the following fields:
- Table actions: Specify the operation against the selected table.
-- Overwrite: Overwrite the existing data and schema in the table using the new values. If this operation is selected, you can enable partition on your target table:
--- Enable Partition: This selection allows you to create partitions in a folder structure based on one or multiple columns. Each distinct column value (pair) is a new partition. For example, "year=2000/month=01/ file". This selection supports insert-only mode and requires an empty directory in the destination.
----Partition column name: Select from the destination columns in schemas mapping. Supported data types are string, integer, boolean, and datetime. Format respects type conversion settings under the Mapping tab.
Incorrect:
Not A:
* Append: Append new values to existing table.
* Etc.
Not C: The following tables contain more information about a copy activity in Lakehouse.
Source information
* Enable partition discovery
Whether to parse the partitions from the file path and add them as extra source columns.
* Etc.
Reference: https://learn.microsoft.com/en-us/fabric/data-factory/connector-lakehouse-copy-activity
정답:
Explanation:
Load to Delta Lake table
The Lakehouse in Microsoft Fabric provides a feature to efficiently load common file types to an optimized Delta table ready for analytics. The Load to Table feature allows users to load a single file or a folder of files to a table. This feature increases productivity for data engineers by allowing them to quickly use a right-click action to enable table loading on files and folders. Loading to the table is also a no-code experience, which lowers the entry bar for all personas.
Reference: https://learn.microsoft.com/en-us/fabric/data-engineering/load-to-tables