Direct Lake Mode Best Practices¶

Overview¶

Use V-Order: This optimizes the physical layout of your data for faster querying ¹
Merge small files: Regularly compact Parquet files to improve query performance and merge changes with OPTIMIZE commands ²
Compress data: Use compression methods like Snappy for better performance
Keep the Delta log minimal: Minimize the effect of data updates on the Delta log

Include only necessary columns: Remove unnecessary columns to reduce storage size and loading time, even if they don't affect query performance directly
Avoid views and bidirectional relationships: Views can force a fallback to DirectQuery, and many-to-many or bidirectional relationships are not performant
Pre-aggregate data: Reduce the data load by aggregating data before it is loaded into the model

Write efficient DAX: Filter only required columns in your DAX measures instead of using ALL(Table)
Manage data refresh: Control when your data is updated by disabling automatic propagation if you need to refresh the entire semantic model at once. Use manual or programmatic refreshes to ensure your model is in a consistent state
Use pure Direct Lake mode for authoring: Test your model in pure Direct Lake mode to ensure maximum performance before considering fallback options in production

Distribute reports using apps: Do not grant end-users direct workspace access. Instead, share reports and data through a Power BI app, as recommended in the Microsoft documentation