How to boost a cloud-native lakehouse with 2x performance

史少锋

Chinese Session 2023-08-19 13:30 GMT+8  #datalake

With the integration of cloud computing and big data technology, cloud-native lake warehouse has gradually become a hot spot in the industry. Kubernetes and Apache Spark, as open source technology leaders in cloud native and big data, combine to become the first choice of many people. However, using Spark as a lake analysis engine on Kubernetes faces many challenges, such as resource management, task scheduling, storage interconnection, elastic scalability, and high reliability. Kyligence has accumulated a wealth of experience in serving a variety of clients, and actively participates in and gives back to the community; This talk will share how to build a cloud-native lake analytics engine out of the box based on a range of open source technologies such as K8S, Spark, Gluten, Volcano, Kyuubi, and more.

Speakers:


Shi Shaofeng: Kyligence Inc, Chief architect, Shaofeng Shi, Chief Architect, Kyligence; Apache Kylin committer & PMC. He has worked for eBay and IBM, and as a core member, participated in the R&D and open source process of Apache Kylin project. He has long-term in-depth research and rich practical experience on the development trend of big data and cloud computing industry, industrial ecology and technology productization.