Sharing the architecture of DevLake, a research and development performance data integration platform

陈映初

Chinese Session 2022-07-29 14:10 GMT+8  #integration

DevLake is an open source r&d data platform that provides automated, one-stop data collection, analysis, and visualization capabilities to help r&d teams better understand the development process and identify key bottlenecks and efficiencies. One of the biggest challenges in the development of a multi-source integration platform like DevLake is the complexity of data sources and the sheer volume of data. In the original architecture, there were three issues including data loss, data distortion, and the need to request the API repeatedly after each tweak. This presentation shares how the team overcame challenges during the evolution of the architecture and hopefully provides a reference for the design of a framework for data integration and processing.

Speakers:


Yingchu Chen: Merico, Software Development Engineer, Currently I am responsible for the back-end development of Merico, mainly maintaining Apache/Incubator-Devlake. I once participated in the Southwest Final of China Chuangyi Cultural Innovation Competition.