Back to FAQ library

What technologies are used in the Data Cloud architecture?

  • Apache Parquet | Columnar storage file format
  • Apache Iceberg | Table format
  • Apache Airflow | Segmentation workflow
  • Apache Spark | Analytics and ML
  • Trino/PrestoSQL | Distributed SQL
  • AWS S3 | cold storage
  • AWS DynamoDB | hot storage
  • AWS EMR | Distributed compute
  • Use of Spark in Data Cloud segmentation processing. More info here.
  • Use of Spark distcp in activation egress step. More info here.

Diagram