This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which format defines infrastructure using YAML? A) ARM B) CloudFormation C) Terraform D) All Correct Answer: D Explanation ARM, CloudFormation, and Terraform are tools for defining infrastructure as code. ARM (Azure Resource Manager) and CloudFormation (AWS) support JSON and YAML for declarative resource definitions. Terraform uses HCL…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Terraform is a tool for: A) Monitoring B) Orchestration C) Infrastructure as Code D) Backup Correct Answer: C Explanation Terraform is an Infrastructure as Code (IaC) tool that defines, provisions, and manages cloud infrastructure using declarative configuration. It supports multiple cloud providers, enabling a unified way to manage…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which service is a “serverless” type? A) EC2 B) Lambda C) EMR D) Databricks Correct Answer: B Explanation AWS Lambda is a serverless compute service that runs code without server management. You pay only for execution time, and Lambda automatically scales based on request volume. It is…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Azure Data Factory is: A) Storage B) ETL orchestrator C) Visualization tool D) Scheduler Correct Answer: B Explanation Azure Data Factory (ADF) is a cloud ETL and ELT service for orchestrating data flows. It allows extracting data from various sources, transforming it, and loading it into target…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: In AWS, data flows are often processed using: A) Lambda B) S3 C) Glue D) All of the above Correct Answer: D Explanation In AWS, data flows are processed using a combination of several services. S3 serves as a data storage layer, Lambda enables serverless and event-driven processing, and Glue provides ETL…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: “Which cloud provides BigQuery?: A) AWS B) Azure C) GCP D) Oracle Correct Answer: C Explanation BigQuery is a fully managed serverless data warehouse service provided by Google Cloud Platform (GCP). It enables large-scale data analysis using SQL…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: “Data Lakehouse” combines features of: A) Lake + Warehouse B) OLTP + OLAP C) Kafka + Spark D) Hadoop + Hive Correct Answer: A Explanation A Data Lakehouse is a modern data architecture that integrates the strengths of a Data Lake and a Data Warehouse. From the Data Lake it inherits the ability to…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Delta Lake adds to Parquet files: A) Schema B) Transactions C) Metadata D) All of the above Correct Answer: D Explanation Delta Lake is an open-source storage layer that turns plain Parquet files into a robust transactional data lake with ACID properties. Delta Lake adds four key layers on top…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which Airflow component defines dependencies? A) Task B) DAG C) Operator D) Hook Correct Answer: B Explanation A DAG (Directed Acyclic Graph) is the core building block of Apache Airflow that defines the entire workflow including all tasks and their dependencies. A DAG is an acyclic…
This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which of these systems is a batch engine? A) Apache Spark B) Kafka C) Flink D) RabbitMQ Correct Answer: A Explanation Apache Spark is primarily a batch processing engine that has become the de facto standard for large-scale data processing in distributed environments. Unlike streaming systems, Spark processes data in…