120 GB from Kafka to ClickHouse in 4 minutes

120 GB from Kafka to ClickHouse in 4 minutes
Megaladata performance test on a real-world ETL workflow

Recently, we conducted a performance test of Megaladata using a realistic ETL workflow – without synthetic benchmarks, simplified datasets, or artificial optimizations.
 

Test setup

  • Data source: Apache Kafka
  • Data volume: 120 GB
  • Message format: JSON
  • Target database: ClickHouse
  • Operating system: Linux
  • Megaladata Server: 24 CPU cores | 64 GB RAM
  • Platform version: Megaladata 7.3.0
     

Workflow

The workflow is as close to a real production scenario as possible:

  • JSON messages consumed from Apache Kafka
  • Parsing into a flat table
  • Lightweight data transformations
  • Statistics calculation
  • Loading results into ClickHouse

In other words, a typical stream → transform → analytical storage (ETL) pipeline that many teams run every day.

 

Result

  • Total execution time: 4 minutes 2 seconds

120 GB of data was processed from Kafka to ClickHouse – including parsing, transformations, and calculations.

No Spark cluster. No manual JVM tuning. No complex code.
 

A video demonstrating the full execution of the test is available below.

 

Why this matters

ETL platform performance is often tested:

  • On small datasets,
  • Without real transformations,
  • Or in laboratory conditions far from production.

Here, this is a realistic, honest scenario that can be easily transferred to real systems:

  • Streaming analytics
  • Logging
  • Telemetry
  • Event processing
  • Transactional data streams
  • IoT
  • Finance
     

What this shows in practice

  • Megaladata comfortably processes tens and hundreds of gigabytes of data
  • Suitable not only for analytics, but also for heavy ETL / streaming workloads
  • Low-code does not mean “slow”
  • The platform scales vertically and uses hardware efficiently
     

Conclusion

Megaladata is not just a “visual Excel for analysts.” It is a full-featured, high-performance data processing platform that can compete with traditional ETL stacks while remaining simple to use.
 

If you need to quickly ingest data from Kafka, transform it, calculate metrics, and load it into ClickHouse or another analytical database, this test speaks for itself.
 

A video of the test execution:

See also

How data analysis (still) influences the decision-making process in 2026
How data analysis (still) influences the decision-making process in 2026
In 2026, the phrase "data-driven" feels outdated. With the rapid evolution of agentic AI, autonomous software agents are no longer just analyzing charts; they are executing complex workflows, triggering...
When to Avoid AI in Business Analytics: Counter-Examples
When to Avoid AI in Business Analytics: Counter-Examples
While many resources explain how to use artificial intelligence (AI) in business analytics, it is just as important to recognise the tasks where AI, especially large language models (LLMs), is either...
Megaladata at e-Logi Event
Megaladata at e-Logi Event
On May 23, Megaladata was invited to participate in e-Logi Networking Day, a professional event hosted by the Armenian Logistics Association.

About Megaladata

Megaladata is a low code platform for advanced analytics

A solution for a wide range of business problems that require processing large volumes of data, implementing complex logic, and applying machine learning methods.
GET STARTED!
It's free