Master the DP-600

An interactive study guide built on 7 memory techniques to help you pass the Microsoft Fabric Analytics Engineer Associate exam.

Plan 10-15% Prepare 40-45% Semantic 20-25% Explore 20-25%

What it covers

Semantic models, DAX, Direct Lake, query folding, security (RLS/OLS/CLS), deployment pipelines, and Power BI report optimisation within Microsoft Fabric.

Ideal for

BI developers, Power BI report authors, data analysts building semantic models, and analytics engineers working in Microsoft Fabric.

Aspire to this if

You're a data analyst wanting to move into analytics engineering, or a SQL developer looking to master DAX and the modern Microsoft BI stack.

Section 1 / Spatial Memory

The Map

An interactive architecture diagram of Microsoft Fabric. Click any node to explore key exam facts.

10-15%

Plan &
Manage

40-45%

Prepare &
Serve Data

20-25%

Semantic
Models

20-25%

Explore &
Analyse

Section 2 / Narrative Memory

The Story

Follow data on its journey through Microsoft Fabric — from raw arrival to polished insight.

📦

The Arrival

Raw data arrives at the grand harbour of OneLake, the one lake to rule them all. Every piece of data in Microsoft Fabric automatically docks here — no exceptions. OneLake is built atop Azure Data Lake Storage Gen2, providing a single unified namespace for the entire organisation.

Exam Intel OneLake = OneDrive for data. One logical lake per tenant. Built on ADLS Gen2. All Fabric items store data in OneLake automatically.

🏗️

The Factory Floor

Inside the Data Factory, pipelines hum with 170+ connectors pulling data from every corner of the enterprise. The Copy Activity moves raw materials from source to destination, while orchestration pipelines coordinate the entire operation.

Exam Intel Data Factory = orchestration + data movement. Copy Activity for transfers. Supports full and incremental loading patterns. Pipeline templates available.

🧹

The Cleaning Crew

Dataflows Gen2 workers scrub, transform, and polish the data using Power Query Online. They watch the folding indicators carefully — 5 states: Folding, Not Folding, Might Fold, Opaque, and Unknown. When steps fold, transformations push back to the source.

Exam Intel Query folding = pushing transforms to source. Check icons in Dataflow WEB UI (not desktop!). Column Quality/Distribution/Profile for data profiling. CSV files can't fold.

🔬

The Laboratory

Data scientists in the Notebook Lab run PySpark experiments. They broadcast small DataFrames to every node, use withColumn to engineer features, and maintain their Delta tables with regular VACUUM and OPTIMIZE cycles.

Exam Intel PySpark key methods: broadcast(), .withColumn(), .cast(), df.summary(). Delta maintenance: VACUUM, OPTIMIZE, DESCRIBE HISTORY. Predict function works with Spark SQL and PySpark.

🏛️

The Twin Vaults

Data reaches two great vaults. The Lakehouse welcomes all comers — structured, semi-structured, unstructured — with a flexible schema-on-read philosophy. The Warehouse, more selective, demands structured data and rewards it with full T-SQL power and multi-table transactions.

Exam Intel Lakehouse = schema-on-read, Spark-first, Delta tables. Warehouse = schema-on-write, T-SQL, multi-table transactions. Lakehouse for data engineering/ML. Warehouse for traditional BI/analytics.

🔗

The Shortcut

A magical portal — the OneLake Shortcut — lets data appear in multiple places without being copied. Zero-copy access across workspaces, across lakehouses, even across clouds.

Exam Intel Shortcuts = zero-copy data access. Work across workspaces and external sources. No data duplication. Appear as regular folders/tables in the lakehouse.

🏗️

The Architect's Workshop

The Semantic Model architect shapes raw tables into analytical gold. They choose storage modes wisely: Import for speed, DirectQuery for freshness, Direct Lake for the best of both worlds. Calculation groups reduce redundancy with SELECTEDMEASURE().

Exam Intel Storage modes: Import (fastest), DirectQuery (live), Direct Lake (reads Delta directly). Calculation groups use SELECTEDMEASURE(). Precedence property controls combination order (higher = applied outermost). Only work with explicit measures.

🛡️

The Security Sentinels

Three sentinels guard the data. RLS filters rows with DAX expressions. CLS hides specific columns. OLS — the most powerful — makes entire tables and columns invisible, as if they never existed.

Exam Intel RLS = DAX filter expressions + USERPRINCIPALNAME(). CLS = column-level hiding. OLS = requires Tabular Editor, cascading effect on dependent measures, objects completely invisible. Key difference: RLS filters data, OLS hides metadata.

🎨

The Gallery

Finally, data is displayed in the Gallery of Reports. The Performance Analyzer watches for slow visuals. Viewers browse with confidence, knowing Promoted items are team-approved, Certified items meet org quality standards, and Master data items are the organisation's single source of truth.

Exam Intel Performance Analyzer separates DAX query time from render time. Endorsement: Promoted (team) → Certified (org quality) → Master data (single source of truth). Sensitivity labels propagate downstream. Reduce visuals per page for performance.

🔄

The Cycle Continues

Deployment pipelines move everything from Dev to Test to Prod. Git integration via PBIP format enables PR reviews. The XMLA endpoint opens the door to enterprise tools like Tabular Editor and DAX Studio.

Exam Intel Deployment pipelines: default Dev→Test→Prod (supports 2-10 stages) with data source binding rules. Git: PBIP/PBIR format. XMLA endpoint for Tabular Editor, SSMS, scripted deployments. Impact analysis traces downstream dependencies.

Section 3 / Acronym Memory

Mnemonic Wall

Memorable acronyms and phrases to anchor key exam concepts in your memory.

🗺️

PRISM

Plan, pRepare, Implement (semantic), Scan (explore), Manage

The 4 exam domains. Prepare is the biggest slice at 40-45%.

💾

DILD

Direct Lake, Import, Live connection, DirectQuery

Storage modes from newest to oldest.

🔒

ROC

RLS, OLS, CLS — "ROC solid security"

Security layers. RLS filters Rows, OLS hides Objects, CLS restricts Columns.

🪟

VOW

VAR/RETURN, ORDERBY, WINDOW

"I VOW to learn window functions." DAX window function building blocks.

🔢

SIX

SUMX, Iterators end in X, multi-column requires X

"The SIX rule: if you need multiple columns, add an X." Iterator function rule.

🚀

DTP

Dev, Test, Prod

Deployment pipeline stages in order.

🛠️

TABS

Tabular Editor, ALM Toolkit, Best Practice Analyzer, DAX Studio

The 4 key external tools for Power BI enterprise management.

📊

QDP

Column Quality, Distribution, Profile

Data profiling tools in Dataflows Gen2. Quality = % valid/error/empty. Distribution = distinct vs unique counts. Profile = min/max/statistics.

🔧

VOCO

VACUUM, OPTIMIZE, COMPACT, describe histOry

Delta table maintenance commands.

⭐

PCM

Promoted (team) → Certified (org quality) → Master data (single source of truth)

"PCM = Promoted → Certified → Master data." Three endorsement levels.

⚡

WISE

Where (filter), Identify patterns, Summarize (group), Extend (add columns)

KQL operators in logical order.

⚡

FLASH

Fallback triggers: RLS on tabLe, views (Auto-generated), capacity guard railS, Direct Lake on OneLake Has no fallback

Direct Lake fallback triggers.

🔄

RangeStart / RangeEnd

Incremental refresh parameters. Must be exact names. Query folding strongly recommended. No equality on both parameters. No IR for Direct Lake.

👥

AMVC

Admin, Member, Contributor, Viewer

Workspace roles from most to least powerful.

🔑

BUILD

Build permission Unlocks: Investigate in Excel, Link to semantic models cross-workspace, Design composite models

Build permission capabilities.

Section 4 / Contrast Memory

Aspect	Shortcuts	Copy
Data duplication	No (zero-copy)	Yes
Storage cost	None	Double
Freshness	Always current	Stale until refreshed
Cross-workspace	Yes	Yes
External sources	Yes (ADLS, S3)	Via pipelines
Use when	Real-time access needed	Transformation required

Click to flip back

Section 5 / Visual Grouping

The Cheat Sheet

A dense four-column reference grid — one column per exam domain.

Plan & Manage

10-15%

Workspace Roles

Admin > Member > Contributor > Viewer
Build Permission: Create reports, Analyze in Excel, composite models, cross-workspace access

Deployment Pipelines

Default: Dev → Test → Prod (supports 2-10 stages)
Rules for data source bindings
Impact analysis for downstream deps

Git Integration

PBIP/PBIR text format for PR reviews
Notebooks as source files (.py/.sql default, .ipynb via API)

XMLA Endpoint

Tabular Editor, SSMS
Table partitioning, scripted deployments
Enable for write operations

Governance

Sensitivity Labels: Public → General → Confidential → Highly Confidential
Propagate downstream, block export
Endorsed: Promoted → Certified → Master data (single source of truth)
F-SKU (Fabric), Premium. Shared = PBI only
Lineage: Source → Dataflow → Lakehouse → Model → Reports

Prepare & Serve

40-45%

OneLake & Shortcuts

One lake per tenant. ADLS Gen2. Unified namespace
Shortcuts: zero-copy, cross-workspace, cross-cloud (ADLS, S3)

Storage

Lakehouse: schema-on-read, Spark, Delta + Files, Z-Order, file-level security
Warehouse: schema-on-write, T-SQL, multi-table tx, cross-DB queries
Eventhouse: streaming/events, KQL, telemetry, logs, IoT
SQL Analytics Endpoint: auto-gen for lakehouses, read-only T-SQL

Data Movement

Data Factory: 170+ connectors, Copy Activity, full + incremental
Dataflows Gen2: Power Query Online, query folding (web UI icons!)
Profiling: Column Quality / Distribution / Profile

Query Folding

5 indicator states: Folding, Not Folding, Might Fold, Opaque, Unknown
Check in web UI only (not desktop)
CSV never folds. Required for incremental refresh

Notebooks & Delta

PySpark + Spark SQL. broadcast() for small DFs
.withColumn(), .cast(). predict() in both languages
VACUUM, OPTIMIZE, DESCRIBE HISTORY, Z-Order

Semantic Models

20-25%

Storage Modes

Import: fastest, in-memory, scheduled refresh
DirectQuery: live, slower, no size limit
Direct Lake: reads Delta, best of both
Composite: mix modes in one model
Large format: >10GB or ANY XMLA writes

Direct Lake Fallback

Automatic (default), DirectLakeOnly, DirectQueryOnly
Triggers: RLS on table, views, capacity guardrails

DAX

VAR/RETURN: evaluated once, improves perf
Iterators (X): table + expression, row context
Window: INDEX, OFFSET, WINDOW + ORDERBY/PARTITIONBY
Info: ISBLANK, HASONEVALUE, ISINSCOPE, INFO.*
Calc groups: SELECTEDMEASURE(), precedence, explicit only
Field parameters: dynamic column/measure switching

Incremental Refresh

RangeStart/RangeEnd (exact names!)
Query folding strongly recommended. No equality on both
No IR for Direct Lake. Hybrid = Premium only

External Tools

Tabular Editor: OLS, calc groups, partitions
DAX Studio: query analysis
ALM Toolkit: deployments
BPA + VertiPaq Analyzer

Explore & Analyse

20-25%

Query Languages

T-SQL: warehouse + SQL analytics endpoint, joins, window funcs
KQL: where, summarize, render, extend. Eventhouse/real-time
DAX: EVALUATE + SUMMARIZECOLUMNS. CALCULATE for context
Visual Query Editor: no-code querying

Performance

Performance Analyzer: DAX query vs render time
Query Diagnostics: backend DQ/DL behaviour
Fewer visuals per page
Summary over detail
Dropdowns not lists for high cardinality
Disable unnecessary cross-filtering

Security in Reports

RLS tested with "View as" role
OLS makes fields disappear entirely
Sensitivity labels propagate to reports

Advanced

Aggregation tables: pre-summarised for large facts
ALL/ALLSELECTED/ALLEXCEPT: removing filters
Data Profiling: Quality, Distribution, Profile

Section 6 / Method of Loci

Memory Palace

Walk through five rooms, each representing a domain of the exam. Objects fade in as you scroll.

The Lobby

Fabric Overview — Where your journey begins

🏢

OneLake

One lake per tenant, built on ADLS Gen2, OneDrive for data

👥

Workspace Roles

Admin > Member > Contributor > Viewer. Shared capacity = PBI only

🔑

Build Permission

Create reports from models, Analyze in Excel, composite models

📊

Capacity SKUs

F-SKU (Fabric), Premium P-SKU. F64, F128 for sizing

🏷️

Sensitivity Labels

Public, General, Confidential, Highly Confidential. Propagate downstream

⭐

Endorsement

Promoted (team) → Certified (org quality) → Master data (single source of truth)

The Data Lab

Data Preparation — Where raw becomes refined

🏭

Data Factory

170+ connectors, Copy Activity, pipeline orchestration

🧹

Dataflows Gen2

Power Query Online, query folding indicators (web UI only!)

📊

Data Profiling

Quality (valid/error/empty %), Distribution (distinct/unique), Profile (statistics)

🔗

OneLake Shortcuts

Zero-copy data access across workspaces and clouds

📓

Notebooks

PySpark: broadcast(), withColumn(), cast(). Delta: VACUUM, OPTIMIZE

🔄

Query Folding

5 states: Folding, Not Folding, Might Fold, Opaque, Unknown. Web UI only. CSV never folds

The Model Workshop

Semantic Models — Where data becomes meaning

⚡

Storage Modes

Import (fastest) → Direct Lake (reads Delta) → DirectQuery (live, slowest)

🔄

Direct Lake Fallback

Automatic/DirectLakeOnly/Disabled. Triggers: RLS, views, guardrails

📐

DAX Iterators

End in X: SUMX, AVERAGEX. Need table + expression. Multi-column = must use X

🪟

Window Functions

INDEX (nth row), OFFSET (relative), WINDOW (range). All need ORDERBY

🧮

Calculation Groups

SELECTEDMEASURE(), precedence property, explicit measures only

📈

Incremental Refresh

RangeStart/RangeEnd exact names. Query folding strongly recommended. No IR for Direct Lake

The Security Vault

Security & Governance — Where trust is enforced

🔒

RLS

DAX filter expressions. USERPRINCIPALNAME(). Filters rows only. Dynamic per user

🔐

OLS

Tabular Editor ONLY. Hides entire objects. Cascading: dependent measures vanish

🛡️

CLS

Column-level hiding. Can't protect measures or tables (use OLS for that)

📋

XMLA Endpoint

Enterprise management. Tabular Editor, SSMS. Enable large format for writes

🚀

Deployment Pipelines

Dev → Test → Prod. Data source binding rules. Impact analysis

🔀

Git Integration

PBIP/PBIR format. PR reviews. Notebooks as source files (.py/.sql)

The Observatory

Explore & Analyse — Where insights are discovered

🔍

Performance Analyzer

DAX query time vs render time. Identifies slow visuals

📊

T-SQL

Warehouse + SQL Analytics Endpoint. Visual Query Editor for no-code

⚡

KQL

where, summarize, render, extend. For eventhouse real-time data

🛠️

External Tools

Tabular Editor, DAX Studio, ALM Toolkit, BPA, VertiPaq Analyzer

📉

Report Optimisation

Fewer visuals, summary over detail, dropdowns for high cardinality

🧊

Aggregation Tables

Pre-summarised facts for composite model performance

Section 7 / Pattern Recognition

Pattern Spotter

Decision flowcharts and trigger-answer pattern cards for common exam questions.

Which Data Store?

What type of data?
  ├── Streaming / Events / IoT → Eventhouse (KQL)
  ├── Unstructured / Semi-structured → Lakehouse
  ├── Need multi-table transactions?
  │   ├── Yes → Warehouse
  │   └── No → Need Spark / ML?
  │       ├── Yes → Lakehouse
  │       └── No → Need T-SQL?
  │           ├── Yes → Warehouse
  │           └── Either works → Lakehouse (more flexible)

Which Storage Mode?

Where is the data?
  ├── OneLake Delta tables → Direct Lake (default for Fabric)
  │   └── Need guaranteed no fallback? → DirectLakeOnly setting
  ├── External source, needs to be live → DirectQuery
  ├── Small-medium, can schedule refresh → Import (fastest performance)
  └── Mix of sources / needs → Composite Model

Which Security Layer?

What do you need to restrict?
  ├── Which rows users see → RLS (DAX expressions)
  ├── Which columns users see → CLS (column hiding)
  ├── Hide entire tables/columns from existence → OLS (Tabular Editor)
  ├── Classify data sensitivity → Sensitivity Labels
  └── File-level in lakehouse → OneLake RBAC

Which External Tool?

What do you need to do?
  ├── Configure OLS → Tabular Editor (only option!)
  ├── Create calculation groups → Tabular Editor (or PBI Desktop)
  ├── Analyse DAX query performance → DAX Studio
  ├── Compare/deploy models between environments → ALM Toolkit
  ├── Check model best practices → Best Practice Analyzer
  └── Analyse model storage/compression → VertiPaq Analyzer

Trigger → Answer Patterns

"zero-copy" or "no data duplication"

→ OneLake Shortcuts

"schema-on-read"

→ Lakehouse

"multi-table transactions"

→ Warehouse

"streaming" or "telemetry" or "IoT"

→ Eventhouse

"objects completely hidden" or "as if deleted"

→ OLS (Object-Level Security)

"Tabular Editor required"

→ OLS configuration

"SELECTEDMEASURE()"

→ Calculation Groups

"RangeStart / RangeEnd"

→ Incremental Refresh

"fallback to DirectQuery"

→ Direct Lake on SQL endpoints

"query folding required"

→ Incremental Refresh

"single source of truth"

→ Master data endorsement

"propagate downstream"

→ Sensitivity Labels

"VACUUM or OPTIMIZE"

→ Delta Table Maintenance

"broadcast()"

→ Small DataFrame optimisation in PySpark

"precedence property"

→ Calculation Groups

"PBIP or PBIR format"

→ Git Integration / version control

"impact analysis"

→ Deployment Pipelines / Lineage

"enable large format even for small models"

→ XMLA write operations

Ready to certify?

Train with practitioners, not presenters

Lucid Labs delivers Microsoft certification training grounded in real-world project experience. We adapt every session to your team's environment, data stack, and business objectives — because the best exam prep comes from engineers who build these solutions every day.

🎯

Tailored Content

Training built around your actual data, your tools, and your use cases — not generic slides.

🛠️

Hands-On Labs

Work through real scenarios in your own environment with expert guidance at every step.

📈

Exam + Capability

Pass the exam and build lasting skills your team can apply from day one.

Talk to us about Fabric Analytics & Power BI training

Custom training for teams & individuals — remote or on-site across Australia

Keith Oak

Director & Principal Consultant — Lucid Labs

Microsoft Solutions Partner architect specialising in Fabric, Azure Data & AI, and GitHub Enterprise. 18+ years delivering data platforms for Australian businesses — building the systems these exams test every day.

LinkedIn ↗ lucidlabs.com.au ↗ Published 29-03-2026