History and technical analysis of the real-time data warehouse Apache Doris

Dongjin Zhang

Chinese Session 2022-07-31 10:30 GMT+8  #keynote

Apache Doris is an emerging open source real-time data warehouse project in recent years, featuring high performance, complete scenario support, ease of use and easy operation and maintenance. This article will provide an overview of the project’s development within Baidu and after it was open sourced, as well as an in-depth analysis of its core features and technical implementation, in the hope that it will help more people to understand, use and participate in this excellent project.

Speakers:


Dongjin Zhang is a distinguished architect of Baidu, the head of Baidu Big Data Infrastructure, and the head of Baidu PALO team. He has led the development of Baidu's earliest big data platform LSP (massive log analysis platform), and is now fully responsible for Baidu's big data infrastructure product system, including the ultra-large scale offline computing service EMR+, high performance stream computing service SC, and the new in-offline converged unified lake warehouse PALO.