Skip to main content
Data & Analytics ⭐ Premium ⭐ Featured

AWS Certified Data Engineer Associate (DEA-C01)

By Webmaster Certland English 📝 340 questions ❤️ 0 likes

Practice exam for the AWS Certified Data Engineer Associate (DEA-C01). Covers data ingestion and transformation, data store management, data operations and support, and data security and governance.

⭐ Premium Updated Mar 2026

Unlock all 340 AWS Certified Data Engineer Associate (DEA-C01) questions

Full simulation · Detailed explanations · Unlimited attempts

  • 340 questions — ~5 full-length simulations
  • Detailed explanations — why each answer is right or wrong
  • Unlimited attempts — retake as many times as needed
  • Smart Practice + Focus Mode + no ads
340
Questions
All certifications
from $4.90/mo

Sample Questions — AWS Certified Data Engineer Associate (DEA-C01)

5 free sample questions from this practice exam. Correct answers are highlighted.

1. A data engineer needs to ingest clickstream events from a website into AWS for real-time processing. The application generates approximately 5,000 records per second, and each record is 2 KB in size. The engineer needs to determine the minimum number of shards required for an Amazon Kinesis Data Stream. How many shards are needed?

A 5 shards
B 10 shards ✓ Correct
C 20 shards
D 2 shards

2. A company is using Amazon Kinesis Data Streams to ingest sensor data. The default data retention period is not long enough for their recovery requirements. A data engineer needs to retain records in the stream for 10 days to allow replay in case of downstream failures. What is the correct action?

A Rely on the default 7-day retention period, which already covers 10 days
B Enable server-side encryption on the stream to extend the retention period
C Modify the stream's retention period to 10 days using the UpdateStreamMode API or console ✓ Correct
D Configure the stream to automatically archive data to Amazon S3 after 24 hours

3. A data engineer is setting up an Amazon AppFlow flow to transfer sensitive customer PII data from Salesforce to Amazon Redshift. The security team requires that all data in transit and at rest must be encrypted with customer-managed keys. The flow must also mask specific fields (such as credit card numbers) before loading into Redshift. Which combination of AppFlow features satisfies both requirements?

A Enable SSL/TLS for the AppFlow connection and use an AWS Glue job to mask fields before loading to Redshift
B Configure Amazon S3 SSE-S3 encryption for the intermediate storage and attach a Lambda function to mask fields in transit
C Configure AppFlow to use an AWS KMS customer-managed key for encryption and use AppFlow's built-in field masking transformation before delivery to Redshift ✓ Correct
D Enable Amazon Macie on the AppFlow destination to detect and mask PII fields automatically after delivery

4. A data engineer is building an AWS Glue job that reads from Amazon DynamoDB and writes to Amazon S3. The DynamoDB table has 50 million items and is heavily used by production applications during business hours. The engineer wants the Glue job to read all 50 million items without impacting production performance. What is the most appropriate approach?

A Read directly from the DynamoDB table using a Glue DynamicFrame with 2 DPUs to limit read throughput
B Export the DynamoDB table to Amazon S3 using DynamoDB Export to S3, then read the export files with Glue ✓ Correct
C Enable DynamoDB Streams on the table and read the full table history via a Glue streaming job
D Enable DynamoDB auto-scaling before the Glue job to provision extra read capacity for the export

5. A data engineering team is designing a new Amazon Redshift cluster to store sales transaction data. The fact table has 500 million rows and is frequently joined to a dimension table that has only 200 rows. Which distribution style should the team use for the dimension table to minimize data movement during joins?

A KEY distribution on the primary key of the dimension table
B ALL distribution to replicate the table to every node ✓ Correct
C EVEN distribution to spread rows uniformly across slices
D AUTO distribution and let Redshift decide

Want to test yourself for real?

Create a free account and run our exam simulation engine.

Free No credit card
  • Simulation engine
  • Up to 10 questions per attempt
  • Score & basic stats
Create free account Already have an account? Sign in
Best
Premium Premium
  • All 340 questions
  • Detailed explanations
  • Smart Practice + Focus Mode
⭐ Get Premium

Information

Questions 340
Time 2h 10min
Difficulty Medium
Minimum Score 72.00%


💰 ROI

Official exam $150.00
CertLand $4.90/mo
Prepare for $150 for less than a coffee/mo

Study Guides & Articles

Related Exams

Discussion

No comments yet. Be the first to start the discussion!

Sign in to join the discussion.