Our vision:

Provide a universal data processing abstraction that simplifies and optimizes data analytics across diverse platforms.

Team DatabloomTeam DatabloomTeam DatabloomTeam DatabloomTeam Databloom

Who is Databloom?

We are a software company that is building the future of data processing with AI. Our flagship product, Blossom Sky, enables decentralized data processing on the edge, allowing data scientists and analysts to train large AI models directly at the source. Blossom Sky leverages Apache Wayang, an open source API for cross-platform data processing that we are actively contributing to. With Blossom Sky, users can run data models and ML training against a variety of decentralized data sources ranging in size from gigabytes to petabytes, while ensuring data privacy and compliance.
In 2021, Jorge and Alexander teamed up to create a startup that would bring Jorge’s research to market. Their goal was to develop the most comprehensive federated learning platform by bootstrapping their research and development efforts. After leaving stealth mode, they donated their project  to the Apache Software Foundation. The foundation accepted it as Apache Wayang (incubating).
Databloom was founded, established operations in Miami, and pioneered 100% remote work in addition to 4 full workdays every week. Out of more than 4,000 looked at startups, Databloom placed in the Top 50 at the famous Pepperdine "Most Fundable Companies" competition. Databloom was highlighted in several international conferences.
Databloom has been selected for the MARL 5G accelerator spring cohort. This program gives mentorship, training and access to deep tech investors, corporates and public institutions to selected startups. Databloom also received its first external investment from MARL 5G. The investment will help Databloom grow its product development and customer acquisition efforts.

The whole story

Alexander had first touch points with Jorge back in 2012 at his time at Cloudera. The research papers, written by Jorge and others in his team at QRCI and HPI, lead to the research of distributed in-memory query engines, and finally to the first in-memory query engine for Apache Hadoop.
Jorge started to investigate the topic of Federated Data Processing and distributed AI in 2015. The team around Jorge developed Rheem in 2016, and presented the software stack at the Spark Summit 2017, followed by multiple conferences.
Alexander met Jorge again 2019 during his visit in Qatar, where he was guest of the Qatar Technology Foundation. After the first meeting with Jorge both realized the huge potential and agreed to found a company to bring this technology to market. From that point on they bootstrapped the further development to build the most complete federated data processing engine.
DataBloom AI, Inc. was finally founded in 2022 in Miami to deal with the increased interest around the Bay Area, Florida and Texas.

Members of our team are frequent speakers at large conventions and meetups, like newWork summit, SXSW, Big Data World, Apache Con, BOSS, Developer Week, etc.

Research distributed data and
data mesh since
downloads so far
The team published
Research Papers

Our Team

See the face to the name

Alexander Alten, CEO Databloom

Alexander Alten

Ceo & co-founder

Entrepreneur,  Developer and Big Data Expert, more than 30 years data experience. Operating at the crossroads of sales and computer science to deliver solutions that people remember.

Dr. Jorge Quaine

Dr. Jorge Quiané

CTO & co-founder

Principal Investigator of Rheem, now Apache Wayang. Passionate on building next-generation Big Data infrastructures. Associate Professor, IT University of Copenhagen

Jessica Liu, CFO databloom.ai

Jessica Liu, Phd


Passionate about Operational Efficiency and Strategic Planning. PhD in Economics, and Expert in Analytics and Modeling. Drives financials with passion and structure.

Vatsal Shah, VP Growth databloom.ai

Vatsal Shah


Roboticist turned ML Engineer turned growth hacker. Always curious, often wrong. Years of experience in sales, growth hacking, Federated Learning and AI.

Zoi Kaoudi

Dr. Zoi Kaoudi

VP Data Science and AI

Key scientist behind Apache Wayang, researches ML and AI, and enthusiast of true AI and generative AI. Inventor data fabric concept. Associate Professor, IT University of Copenhagen.

Dr. Kaustubh Beekar

Dr. Kaustubh Beedkar

VP FLDEV & co-founder

One of the top researcher for data regulation automation, enthusiastic open source developer. Passionate about query processing frameworks and edge AI training.

Thomas Lyu, VP Sales South Korea

Thomas Lyu

VP sales
south korea

IT Consultant, Storage Specialist. More than 20 years experience as Sales Engineer & Sales in South Korea. Exceptional thinker and customer oriented sales project lead, always putting our customers first.

Our Advisory Board

Prof. Dr. Begum Demir

Prof. Dr. Begüm Demir

Technische Universität Berlin

Dr. Begüm Demir is a Scientific Committee member of the SPIE International Conference on Signal and Image Processing for Remote Sensing and co-chair of Image and Signal Processing for Remote Sensing.

Prof. Dr. Volker Markl

Prof. Dr. Volker Markl

Technische Universität Berlin

Dr. Volker Markl is director of the Berlin Institute for the Foundations of Learning and Data (BIFOLD). Volker has mentored, advised and co-founded multiple startups and is recognized as ACM Fellow for his research.

Get Started

Want to get started on your own? Apache Wayang is open source and ready for you to start building your federated data processing engine.
Get Apache wayang
Apache Wayang