Kylin 4.X Feature List

In this pages, I would like to list all new feature and break changes. Some features are in the status of IN PROGRESS, means Kylin team is going to implement this feature, and we will obey its priority. So if you have your suggestion/opinion on current priority, please let us know.

Release Plan

Release version	Expected Date	Comment	Release Detail
4.0.0-alpha	2020-09	Release core features, including new build engin & query engine.s'c	https://issues.apache.org/jira/projects/KYLIN/versions/12348093
4.0.0-beta	2020-12 ~ 2021-01	Implement other important features.	https://issues.apache.org/jira/projects/KYLIN/versions/12348723
4.0.0-gamma	2021-04	Bug fix & Promotion	TODO
4.0.0	2021-07	GA (Ready for production)	TODO
4.1.0	Far future ...

Features

Feature	Description	Comment	Status	Component	Priority	Arrival(Expected)
Kafka Source(NRT)	Ingest streaming data in batch way	In design phase, did not have a conclusion of how to implement.	IN PROGRESS	SOURCE	P1	4.1.0
Kafka Source(Real-time OLAP)	Ingest streaming data in stream/micro-batch way	In design phase, did not have a conclusion of how to implement.	IN PROGRESS	SOURCE	P1	4.1.0
JDBC Source(Original version)	Ingest data via JDBC contract	In design phase, did not have a conclusion of how to implement.	IN PROGRESS	SOURCE	P2	4.1.0
JDBC Source(Datasource SDK)	Ingest data via JDBC contract	In design phase, did not have a conclusion of how to implement.	IN PROGRESS	SOURCE	P2	4.1.0
MapReduce Build Engine	Build pre-calculated cuboid data by Hadoop Mapreduce	This feature maybe useless	DELETED	BUILD ENGINE
Spark Build Engine	Build pre-calculated cuboid data by Apache Spark	New implementation provided	READY	BUILD ENGINE	P0	4.0.0-ALPHA
Flink Build Engine	Build pre-calculated cuboid data by Apache Flink	Support in Kylin 3.1	DELETED	BUILD ENGINE
HBase Storage	Use HBase to store pre-calculated cuboid data.	Discussion in mailing list	DELETED	STORAGE ENGINE
Parquet Storage	Use Parquet to store pre-calculated cuboid data.	Discussion in mailing list	READY	STORAGE ENGINE	P0	4.0.0-ALPHA
Distributed Query Engine / Sparder	Use calcite&catayst(a.k.a. Spark SQL) to parse/analyse/excute a SQL query.	New implementation provided	READY	QUERY ENGINE	P0	4.0.0-ALPHA
Measure - Bitmap	Precise count distinct.	N/A	READY	MEASURE	P0	4.0.0-ALPHA
Measure - HLL	Non-precise count distinct but low cost.	N/A	READY	MEASURE	P0	4.0.0-ALPHA
Measure - TopN	TopN Measure	N/A	IN PROGRESS	MEASURE	P0	4.0.0-BETA
Measure - Percentile	Percentile	N/A	READY	MEASURE	P0	4.0.0-ALPHA
Query Cache	Cache query result in query's memory or external cache service.	N/A	READY	QUERY ENGINE	P0	BEFORE 4.0
HBase Metastore	Use HBase as metastore.	I guess it will be removed in the GA version. (xxyu)	DEPRECATED	METASTORE	P2	BEFORE 4.0
RDBMS Metastore	Use RDBMS as metastore.	Should as the first choice of metastore.	READY	METASTORE	P0	BEFORE 4.0
Cardinality Computation	Calculate cardinality of fact table and dimension table.	Planning	IN PROGRESS	TOOL	P1	4.0.0-BETA
Storage Cleanup	Remove useless data from storage or metastore.	New implementation	READY	TOOL	P0	4.0.0-ALPHA
CSV Source	Build cube from user-side csv file.	New implementation	READY	SOURCE	P1	4.0.0-ALPHA
SQL Standard	to be updated	In testing.	IN PROGRESS	QUERY ENGINE	P0	4.0.0-BETA
Global Dictionary(Hive)	Use hive and MR to build global dictionary	New global dictionary will replace this feature.	DELETED	BUILD ENGINE
Global Dictionary(AppendTireDictionary)	Tire dictionary	New global dictionary will replace this feature.	DELETED	BUILD ENGINE
Global Dictionary(Spark Bucket Dictionary)	Use apache spark to build global dictionary	New implementation	READY	BUILD ENGINE	P0	4.0.0-ALPHA
Cube Planner	to be updated	In design phase, did not have a conclusion of how to implement.	IN PROGRESS	ADVANCED	P0	4.0.0-BETA
System Cube and Dashboard	to be updated	Not well tested, planning	IN PROGRESS	ADVANCED	P0	4.0.0-BETA
Read write Seperatation	The query engine and build engine use different Hadoop cluster.	New implementation provided	READY	ADVANCED	P0	4.0.0-ALPHA
Pushdown Engine	to be updated	New pushdown engine will only support SparkSQL.	READY	QUERY ENGINE	P0	4.0.0-ALPHA
Shrunken Dictionary	to be updated	This feature maybe useless	DELETED	ADVANCED
UHC dictionary	to be updated	This feature maybe useless	DELETED	ADVANCED
Deploy on AWS EMR	Support deploy Kylin on EMR5.x, EMR 6.x . Support Glue.	Planning	IN PROGRESS	ENV	P0	4.0.0-BETA
All-in-one container	Provided a quick-start container for learning purpose.	How to learn Kylin in Docker	READY	ENV	P0	4.0.0-ALPHA
Hadoop3 support	Going to support Hadoop3 + Hive2 in 2020-Q4. Not sure when to suppoort Hive 3. AWS EMR 6.X CDH 6.X (latest 6.3.2)	Planning	IN PROGRESS	ENV	P0	4.0.0-BETA
Hive3 support		N/A	IN PROGRESS	ENV	P2	FUTURE
Spark3 support	Support use Spark3 for build and query.	N/A	IN PROGRESS	ENV	P2	4.1.0
Hybrid Model / Flexible cuboid build	Add dimension or remove dimension without purge whole cube data.	Planning	IN PROGRESS	BUILD ENGINE	P1	4.1.0
Multi-level partition segment	Looke like Hive's multi-level partition design.	N/A	IN PROGRESS	BUILD ENGINE	P1	4.1.0
Use catalyst to replace calcite	Make query analysis quicker and lighter.	N/A	IN PROGRESS	QUERY ENGINE	P1	4.1.0

Link

- Deprecated~Development Plan Kylin 4.0

Space shortcuts

Page tree

Release Plan

Features

Link