In this pages, I would like to list all new feature and break changes. Some features are in the status of , means Kylin team is going to implement this feature, and we will obey its priority. So if you have your suggestion/opinion on current priority, please let us know.


Release Plan

Release versionExpected DateCommentRelease Detail
4.0.0-alpha2020-09Release core features, including new build engin & query engine.s'c
4.0.0-beta2020-12 ~ 2021-01Implement other important features. 
4.0.0-gamma2021-04 Bug fix & Promotion
  • TODO
4.0.0 2021-07GA (Ready for production)
  • TODO
4.1.0Far future ...

Features

FeatureDescriptionCommentStatusComponentPriorityArrival(Expected)
Kafka Source(NRT)Ingest streaming data in batch wayIn design phase, did not have a conclusion of how to implement.


Kafka Source(Real-time OLAP)Ingest streaming data in stream/micro-batch wayIn design phase, did not have a conclusion of how to implement.

JDBC Source(Original version)Ingest data via JDBC contractIn design phase, did not have a conclusion of how to implement.

JDBC Source(Datasource SDK)Ingest data via JDBC contractIn design phase, did not have a conclusion of how to implement.

MapReduce Build EngineBuild pre-calculated cuboid data by Hadoop MapreduceThis feature maybe useless



Spark Build EngineBuild pre-calculated cuboid data by Apache SparkNew implementation provided

Flink Build EngineBuild pre-calculated cuboid data by Apache FlinkSupport in Kylin 3.1



HBase StorageUse HBase to store pre-calculated cuboid data.Discussion in mailing list



Parquet StorageUse Parquet to store pre-calculated cuboid data.Discussion in mailing list

Distributed Query Engine / SparderUse calcite&catayst(a.k.a. Spark SQL) to parse/analyse/excute a SQL query.New implementation provided

Measure - BitmapPrecise count distinct.N/A

Measure - HLLNon-precise count distinct but low cost.N/A

Measure - TopNTopN MeasureN/A

Measure - PercentilePercentileN/A

Query CacheCache query result in query's memory or external cache service.N/A

HBase MetastoreUse HBase as metastore.I guess it will be removed in the GA version. (xxyu)

RDBMS MetastoreUse RDBMS as metastore.Should as the first choice of metastore.

Cardinality ComputationCalculate cardinality of fact table and dimension table.Planning

Storage CleanupRemove useless data from storage or metastore.New implementation

CSV SourceBuild cube from user-side csv file.New implementation

SQL Standardto be updatedIn testing.

Global Dictionary(Hive)Use hive and MR to build global dictionaryNew global dictionary will replace this feature.



Global Dictionary(AppendTireDictionary)Tire dictionaryNew global dictionary will replace this feature.



Global Dictionary(Spark Bucket Dictionary)Use apache spark to build global dictionaryNew implementation

Cube Plannerto be updatedIn design phase, did not have a conclusion of how to implement.

System Cube and Dashboardto be updatedNot well tested, planning

Read write SeperatationThe query engine and build engine use different Hadoop cluster.New implementation provided

Pushdown Engineto be updatedNew pushdown engine will only support SparkSQL.

Shrunken Dictionaryto be updatedThis feature maybe useless



UHC dictionaryto be updatedThis feature maybe useless



Deploy on AWS EMR

Support deploy Kylin on EMR5.x, EMR 6.x .

Support Glue.

Planning

All-in-one containerProvided a quick-start container for learning purpose.How to learn Kylin in Docker

Hadoop3 support

Going to support Hadoop3 + Hive2 in 2020-Q4. Not sure when to suppoort Hive 3.

  • AWS EMR 6.X
  • CDH 6.X (latest 6.3.2)
Planning

Hive3 support
N/A

Spark3 supportSupport use Spark3 for build and query.N/A

Hybrid Model / Flexible cuboid buildAdd dimension or remove dimension without purge whole cube data.Planning

Multi-level partition segmentLooke like Hive's multi-level partition design.N/A

Use catalyst to replace calciteMake query analysis quicker and lighter.N/A



Link

Deprecated~Development Plan Kylin 4.0