Skip to main content
Data ⭐ Premium

Databricks Certified Machine Learning Professional - 340 Questions

By Webmaster Certland English ❤️ 0 likes

Practice exam for the Databricks Certified Machine Learning Professional certification. Covers Model Development with SparkML, MLflow, and Feature Store; MLOps with lifecycle management, drift detection, and automated retraining; and Model Deployment with Mosaic AI Model Serving.

⭐ Premium Updated Mar 2026

Unlock all 340 Databricks Certified Machine Learning Professional - 340 Questions questions

Full simulation · Detailed explanations · Unlimited attempts

  • 340 questions — ~5 full-length simulations
  • Detailed explanations — why each answer is right or wrong
  • Unlimited attempts — retake as many times as needed
  • Smart Practice + Focus Mode + no ads
340
Questions
All certifications
from $4.90/mo

Sample Questions — Databricks Certified Machine Learning Professional - 340 Questions

5 free sample questions from this practice exam. Correct answers are highlighted.

1. A data scientist is building a SparkML pipeline that includes StringIndexer, OneHotEncoder, VectorAssembler, and LogisticRegression. The pipeline must be reusable for both training and scoring. Which approach correctly constructs this pipeline?

A Create a Pipeline with stages=[indexer, encoder, assembler, lr] and call pipeline.fit(train_df) to produce a PipelineModel ✓ Correct
B Create a Pipeline and call pipeline.addStage(indexer).addStage(encoder).addStage(assembler).addStage(lr) then fit()
C Use Pipeline.create(indexer, encoder, assembler, lr).fit(train_df) to build the pipeline
D Fit each stage individually and combine the fitted models into a PipelineModel manually

2. A team building a recommendation engine needs to handle implicit feedback data using collaborative filtering in SparkML. Which configuration of the ALS estimator is most appropriate for implicit feedback?

A Use ALS with implicitPrefs=False and convert all interactions to a 1-5 rating scale
B Use ALS with implicitPrefs=True and tune the alpha parameter to control confidence levels ✓ Correct
C Use ALS with feedbackType='implicit' and set the confidence parameter
D Convert implicit feedback to binary ratings and use ALS with default settings

3. A data scientist needs to encode a categorical column with over 1,000 unique values for use in a SparkML model. Which approach avoids creating an excessively wide feature vector while preserving categorical information?

A Apply StringIndexer followed by OneHotEncoder to create a full one-hot representation
B Use Bucketizer to group the categories into a smaller number of bins
C Use StringIndexer to convert categories to indices and feed them directly into a tree-based model ✓ Correct
D Apply a HashingTF with numFeatures=10 to reduce dimensionality of the categories

4. A company's ML platform team wants to standardize feature scaling across all numeric features before training a LogisticRegression model. The features have different units and ranges. Which SparkML transformer should they use after VectorAssembler?

A StandardScaler with withMean=True and withStd=True after VectorAssembler ✓ Correct
B MinMaxScaler with min=0 and max=1 after VectorAssembler
C Normalizer with p=2 after VectorAssembler
D MaxAbsScaler after VectorAssembler

5. A data scientist wants to perform hyperparameter tuning for a GBTClassifier using cross-validation in SparkML. They need to search over maxDepth values of [3, 5, 7] and maxIter values of [50, 100, 200]. How many total models will be trained with 5-fold cross-validation?

A 9 models
B 45 models ✓ Correct
C 27 models
D 135 models

Want to test yourself for real?

Create a free account and run our exam simulation engine.

Free No credit card
  • Simulation engine
  • Up to 10 questions per attempt
  • Score & basic stats
Create free account Already have an account? Sign in
Best
Premium Premium
  • All 340 questions
  • Detailed explanations
  • Smart Practice + Focus Mode
⭐ Get Premium

Related Exams

Discussion

No comments yet. Be the first to start the discussion!

Sign in to join the discussion.