Apache Incubator Track

[ Tracks | Register and Attend ]
Tuesday 16:15 UTC
How to Slide Your Release Pass the Incubator
Justin Mclean

All podling releases need to be voted on by the incubator PMC before being released to the world. I'll go through what the incubator PMC looks for in every release and what you can do to make it pass that IPMC vote and get your project one step closer to graduation. More importantly I'll cover where you can get help if you need it. In this talk, I'll describe current incubator and ASF policy, recent changes that you may not be aware of, and go into detail the legal requirements of common open source licenses and the best way to assemble your NOTICE and LICENSE files. Where possible I describe the reasons behind why things are done a certain which may not always be obvious from our documentation. I'll show how I review a release and the simple tools I use. I'll go through a worked example or two, including a fictional project called Apache Wombat, and cover common mistakes I've seen in releases.

Justin Mclean has more than 25 years’ experience in developing web-based applications and is heavily involved in open source hardware and software. He runs his own consulting company Class Software and has spoken at numerous conferences in Australia and overseas. In his free time, he's active in several Apache Software Foundation projects, including the Apache Incubator, and is a mentor for a number of their projects. He's currently the chair of the Apache Incubator and on the ASF board. He also teaches at an online college and runs the IoT meetup in Sydney.

Tuesday 16:55 UTC
Apache IoTDB: Growing a bilingual community
Julian Feinauer

Open Source in general but also the ASF gets more and attention in Asia and especially in China. Many projects with initial chinese communities joined the incubator in the last years. This is a very positive development but over the last years we experienced that chinese communities often have different needs and challenges when learning to adopt to the Apache Way. One very important example is the language barrier which is present as many chinese developers are not fluent in english or not as fluent as most developers from western countries. We feel that the IoTDB community was really successful in adopting the Apache Way and in this talk we want to share our approaches and our learnings.

Julian Feinauer joined the IoTDB Community as one of the first external Contributors.

Tuesday 17:35 UTC
Past, now and future about Apache YuniKorn (incubating): Cloud-Native resource scheduler
Wilfred Spiegelenburg Wangda Tan

Apache YuniKorn (Incubating) is a light-weight, universal resource scheduler for container orchestrator systems. It was created to achieve fine-grained resource sharing for various workloads efficiently on a large scale, multi-tenant, and cloud-native environment. YuniKorn brings a unified, cross-platform, scheduling experience for mixed workloads that consist of stateless batch workloads and stateful services. YuniKorn now supports K8s and can be deployed as a custom K8s scheduler. YuniKorn's architecture design also allows adding different shim-layer and adapt to different ResourceManager implementation including Apache Hadoop YARN, or any other systems. For this talk, we will talk about gaps in resource scheduling in Cloud-Native environment, and how YuniKorn can support running big data applications (like Spark/Flink/Tensorflow, etc.) on K8s. We will talk about existing and upcoming features of YuniKorn (including hierarchical of queues, resource fairness, gang scheduling support, integration with K8s features, quota management, autoscaling, etc.). We will also share how YuniKorn being used in community partners such as Alibaba, Cloudera, Lyft.

Wilfred is a Staff Software Engineer from Cloudera in Australia. He’s also PMC member of Apache YuniKorn (incubating), Apache Hadoop committer. He has worked on Hadoop for 6 years mainly on YARN, MapReduce, and Spark. Before Cloudera, he has worked for SUN Microsystems and Oracle as part of the Identity Management teams as a developer and consultant for over 10 years. Wilfred started his career as a lecturer at the Amsterdam University of Applied Science; teaching, designing, and implementing multiple IT systems. Wilfred holds a Master's in Decision Support Systems from Sunderland University. Wangda Tan is Sr. Manager of Compute Platform engineering team @ Cloudera, responsible for all engineering efforts related to Kubernetes, Apache Hadoop YARN, Resource Scheduling, and internal container cloud. In the open-source world, he's a member of Apache Software Foundation (ASF), PMC Chair of Apache Submarine project, He is also project management committee (PMC) members of Apache Hadoop, Apache YuniKorn (incubating). Before joining Cloudera, he leads High-performance-computing on Hadoop related work in EMC/Pivotal. Before that, he worked in Alibaba Cloud and participated in the development of a distributed machine learning platform (later became ODPS XLIB).

Tuesday 18:15 UTC
Apache Superset - A data visualization platform
Maxime Beauchemin

This talk explores Apache Superset through a live demo, and provides a high level understanding of what it offers as a product. We'll also share our journey thus far, and explore what it takes to grow an open source project, a community and a movement. We'll look at a retrospective of the design decisions, technology choices and engineering challenges that have shaped Superset. We'll also take a deep look into the current challenges the community is currently facing, and peak at what is ahead.

Max Beauchemin has worked at the leading edge of data and analytics his entire career, helping shape the discipline in influential roles at data-dependent companies like Facebook, Airbnb, Lyft and Yahoo!. A leader in the open-source community, Max is the creator of Apache Airflow, an open-source tool for orchestrating complex computational workflows and data processing pipelines and Apache Superset, a popular open-source data visualization, exploration and analytics platform. More recently he founded Preset, a company devoted to building upon Superset to offer next generation analytics as a service.

Wednesday 09:00 UTC
Apache Incubator, & How incubator communities are built ? (in Hindi language) [ALC Indore]
Aditya Sharma

This talk will be part of Track prepared by ALC Indore, and *language for the talk will be Hindi*. The talk will include the details on Apache Incubator, how to it works, and the important topic will be how to build a community around your incubating project. ## What is Apache Incubator? Apache Incubator is the gateway for open-source projects intended to become fully-fledged Apache Software Foundation projects. -- History -- Incubation process -- Current incubating projects -- How to contribute to the incubating project? ## How to build a community for your incubating project? The community is core for the success of any open source project, in this topic I will share some tips which can help you Incubating project to grow its community. After all, it is Community over Code.

Project Management Committee (PMC) member at Apache OFBiz and Apache Roller, Apache Local Community (ALC) Indore Chapter Lead

Wednesday 09:40 UTC
Apache APISIX: How to implement plugin orchestration in API GatewayEdit proposal
Ming Wen

Apache APISIX is a cloud-native API gateway that provides the same plugin mechanism as other gateways. However, in Apache APISIX, plugin orchestration is also provided that allows users to control the conditions and order for running plugins. Apache APISIX uses DAG(Directed Acyclic Graph) to implement this feature. In this share, we'll introduce Apache APISIX, and use a few examples to explain the advantages of plugin orchestration, and the specific implementation.

PPMC member of Apache APISIX CEO of ZhiLiu Technology Co., Ltd, China Speaker of ApcheCon EU 2016

Wednesday 10:20 UTC
ECharts: a storyteller of visualization evolution
Wenli Zhang

With the increasing demand for data visualization and a deeper understanding of theories, the role of data visualization tools has changed dramatically over the years. Previously, the main expectation was to help users understand abstract data through a static chart. Later, interactive tools were introduced to help users better understand the relationships between data. Today, another important aspect of our expectations is the ability to tell the stories. We expect visualization tools to help users explore and think about the story behind the data and get inspired or motivated to take further steps after reading the charts. In this sharing, we will introduce why and how Apache ECharts (incubating) has evolved to adapt to the changing needs and formed a modern visualization tool as you see today.

Wenli is a data visualization developer and PPMC of Apache ECharts (incubating). She has open-sourced many data visualization projects on GitHub (ID: Ovilia) and is enthusiastic about open-source community.

Wednesday 16:15 UTC
Hatching the Clutch - A Guide to the Apache Incubator
Dave Fisher

Podlings are said to be part of the Clutch. On a daily basis the status of podlings is evaluated from available information. This talk will describe the clutch evaluation process and how that feeds into the Incubator website. Additional topics: Whimsy and Podling information. Bootstrapping a Podling. Updating podling status.

Apache Software Foundation Member and Incubator Mentor.

Wednesday 16:55 UTC
Advice to Incubator Mentors
Justin Mclean

So you signed up to become a mentor for a project? Do you know what it entails or what is expected of you? Your project is relying on you to help guild it to graduation. In this talk, I'll be giving an overview of the ASF incubation process, the pitfalls to watch out for, and how projects become successful. I'll focus on everyday situations and challenges that podlings face and what's a good way to deal for mentors to deal with them. This talk is is for anyone who is thinking of being a mentor, is currently a mentor or for projects wanting a smooth path to graduation.

Justin Mclean has more than 25 years’ experience in developing web-based applications and is heavily involved in open source hardware and software. He runs his own consulting company Class Software and has spoken at numerous conferences in Australia and overseas. In his free time, he's active in several Apache Software Foundation projects, including the Apache Incubator, and is a mentor for a number of their projects. He's currently the chair of the Apache Incubator and on the ASF board. He also teaches at an online college and runs the IoT meetup in Sydney.

Wednesday 17:35 UTC
Daffodil - Kill the Data Format Problem
Michael Beckerle

Daffodil is an incubator project. Its goal is killing the data format problem by providing an implementation of DFDL (Data Format Description Language - an emerging standard from the Open Grid Forum) that we can all use and extend, and integrate into all our data-consuming frameworks and applications. This talk will use numerous compelling examples of Daffodil parsing, and unparsing (reconstructing) data in a variety of data formats - textual and binary, industry standard formats, and ad-hoc one-of-a-kind formats as well, and using both XML and JSON to make the data tangible and visible.

Apache commiter since 2017. Currently applying knowledge of scalable computing systems and data format issues at Owl Cyber Defense Solutions (formerly Tresys Technology). Since 2002, Co-chair DFDL Workgroup of Open Grid Forum - working towards a standard for data format description so we can all stop solving this problem over and over again. Former life as a CTO of a few small/startup companies. Likes to program in Scala and to do data archeology to figure out data from just the bits.

Wednesday 19:35 UTC
Teaclave: A Universal Secure Computing Platform
Mingshen Sun

Apache Teaclave (incubating) is a universal secure computing platform to make computation on privacy-sensitive data safe and secure. The platform adopts multiple security technologies to enable secure computing, in particular, Teaclave uses Intel SGX to serve the most security-sensitive tasks with hardware-based isolation, memory encryption and attestation. Teaclave is provided as a function-as-a-service platform and with many built-in functions, it supports a wide variety of tasks on sensitive data, such as privacy preserving machine learning, private set intersection, and cryptographic computation. More importantly, unlike traditional FaaS, Teaclave supports both general secure computing tasks and flexible single- and multi-party secure computation. Last but not least, Teaclave is written in Rust to prevent memory-safety issues. Teaclave entered the Apache Incubator in August 2019. In this talk, we would like to introduce this project to the whole community for the first time. We will discuss some motivation and background of the secure computing ecosystem. Then, we will present highlights of the Teaclave platform and its internal design, talk about the roadmap of incubating and current progress, and finally, introduce current status of Teaclave community.

Mingshen Sun works at Baidu and is a member of Apache Teaclave (incubating) PPMC (Podling Project Management Committee). He leads, maintains and actively contributes to several open source projects including Teaclave, MesaPy, Rust OP-TEE TrustZone SDK, etc. Please visit his homepage (https://mssun.me) for more information.