Pinterest launched a next-generation CDC-based database ingestion framework using Kafka, Flink, Spark, and Iceberg. The system reduces data availability latency from 24+ hours to 15 minutes, processes ...
Update implements Jakarta EE 11 platform and brings support for Jakarta Data repositories and virtual threads.
A demonstration of GPU acceleration benefits in Apache Spark workloads using NVIDIA RAPIDS. This project provides measurable performance improvements through real-world machine learning and data ...
AWS Managed Kafka and Apache Kafka, a distributed event streaming platform, has become the de facto standard for building real-time data pipelines. However, ingesting and storing large amounts of ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results