# 概览

![img.png](https://2758483936-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FcIRMOn4u9hdIi6agitCf%2Fuploads%2Fgit-blob-6d78d862255e03def022ffc06a7c9b07d7c200bb%2Flogo.png?alt=media)

## repository

[![License](https://img.shields.io/badge/license-MIT-green.svg)](https://opensource.org/licenses/MIT/)

[![Stargazers over time](https://starchart.cc/collabH/repository.svg)](https://shimin-huang.gitbook.io/doc/readme)

### 概述

* 个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
* [在线文档](https://repository-1.gitbook.io/bigdata-growth/)

### RoadMap

![roadMap](https://2758483936-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FcIRMOn4u9hdIi6agitCf%2Fuploads%2Fgit-blob-35eaa57bef8f17ec75a3f0a3f6788e5ddefd287b%2Froadmap.jpg?alt=media)

### 基础能力

#### 数据结构

#### 分布式理论

* [分布式架构](https://shimin-huang.gitbook.io/doc/base/fen-bu-shi-li-lun/fen-bu-shi-jia-gou)

#### 计算机理论

* [LSM存储模型](https://shimin-huang.gitbook.io/doc/base/ji-suan-ji-li-lun/lsm-cun-chu-mo-xing)

#### Scala

* [ScalaOverView](https://shimin-huang.gitbook.io/doc/base/scala/scalaoverview)

#### JVM

#### Java

**并发编程**

* [认识并发编程](https://shimin-huang.gitbook.io/doc/base/java/bing-fa-bian-cheng/ren-shi-bing-fa-bian-cheng)
* [并发工具包](https://shimin-huang.gitbook.io/doc/base/java/bing-fa-bian-cheng/bing-fa-gong-ju-lei-concurrent)

**JDK源码**

**todo**

### 算法

* [算法题解](https://shimin-huang.gitbook.io/doc/base/algorithm/suan-fa-ti-jie)

### BigData

#### cache

**数据编排技术**

**alluxio**

* [Alluxio概览](https://shimin-huang.gitbook.io/doc/bigdata/cache/alluxio/alluxiooverview)
* [Alluxio部署](https://shimin-huang.gitbook.io/doc/bigdata/cache/alluxio/alluxiodeployment)
* [Alluxio整合计算引擎](https://shimin-huang.gitbook.io/doc/bigdata/cache/alluxio/alluxiowithengine)

#### datalake

**hudi**

* [Hudi概览](https://shimin-huang.gitbook.io/doc/bigdata/datalake/hudi/hudioverview)
* [Hudi整合Spark](https://shimin-huang.gitbook.io/doc/bigdata/datalake/hudi/hudiwithspark)
* [Hudi整合Flink](https://shimin-huang.gitbook.io/doc/bigdata/datalake/hudi/hudiwithflink)
* [Hudi调优实践](https://shimin-huang.gitbook.io/doc/bigdata/datalake/hudi/hudi-tiao-you-shi-jian)
* [Hudi原理分析](https://shimin-huang.gitbook.io/doc/bigdata/datalake/hudi/hudi-yuan-li-fen-xi)
* [hudi数据湖实践](https://shimin-huang.gitbook.io/doc/bigdata/datalake/hudi/hudi-shu-ju-hu-shi-jian)

**iceberg**

* [IceBerg概览](https://shimin-huang.gitbook.io/doc/bigdata/datalake/iceberg/icebergoverview)
* [IceBerg整合Flink](https://shimin-huang.gitbook.io/doc/bigdata/datalake/iceberg/icebergwithflink)
* [IceBerg整合Hive](https://shimin-huang.gitbook.io/doc/bigdata/datalake/iceberg/icebergwithhive)
* [IceBerg整合Spark](https://shimin-huang.gitbook.io/doc/bigdata/datalake/iceberg/icebergwithspark)

#### kvstore

**K-V结构存储,如Hbase、RocksDb(内嵌KV存储)等**

**rocksDB**

* [rocksDB概述](https://shimin-huang.gitbook.io/doc/bigdata/kvstore/rocksdb/rocksdboverview)
* [rocksDB配置](https://shimin-huang.gitbook.io/doc/bigdata/kvstore/rocksdb/rocksdb-pei-zhi)
* [rocksDB组件描述](https://shimin-huang.gitbook.io/doc/bigdata/kvstore/rocksdb/rocksdb-zu-jian-miao-shu)
* [rocksdb on flink](https://shimin-huang.gitbook.io/doc/bigdata/kvstore/rocksdb/rocksdb-on-flink)
* [rocksdb API](https://github.com/collabH/repository/blob/master/bigdata/kvstore/rocksdb/RocksDB%20API.xmind)

#### HBase

* [HBase概览](https://shimin-huang.gitbook.io/doc/bigdata/kvstore/hbase/hbaseoverview)
* [HBaseShell](https://github.com/collabH/repository/blob/master/bigdata/kvstore/hbase/HBase%20Shell.xmind)
* [HBaseJavaAPI](https://github.com/collabH/repository/blob/master/bigdata/kvstore/hbase/HBase%20Java%20API.xmind)
* [HBase整合MapReduce](https://shimin-huang.gitbook.io/doc/bigdata/kvstore/hbase/hbase-zheng-he-di-san-fang-zu-jian)
* [HBase过滤器](https://shimin-huang.gitbook.io/doc/bigdata/kvstore/hbase/hbase-guo-lv-qi)

#### Hadoop

**广义上的Hadoop生态圈的学习笔记，主要记录HDFS、MapReduce、Yarn相关读书笔记及源码分析等。**

**HDFS**

* [Hadoop快速入门](https://github.com/collabH/repository/blob/master/bigdata/hadoop/Hadoop%E5%BF%AB%E9%80%9F%E5%BC%80%E5%A7%8B.xmind)
* [HDFSOverView](https://github.com/collabH/repository/blob/master/bigdata/hadoop/HDFS/HDFSOverView.xmind)
* [Hadoop广义生态系统](https://github.com/collabH/repository/blob/master/bigdata/hadoop/Hadoop%E5%B9%BF%E4%B9%89%E7%94%9F%E6%80%81%E7%B3%BB%E7%BB%9F.xmind)
* [Hadoop高可用配置](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/hadoop-gao-ke-yong-pei-zhi)
* [HadoopCommon分析](https://github.com/collabH/repository/blob/master/bigdata/hadoop/HDFS/HadoopCommon%E5%8C%85%E5%88%86%E6%9E%90.pdf)
* [HDFS集群相关管理](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/hdfs/hdfs-ji-qun-guan-li)
* [HDFS Shell](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/hdfs/hdfs-shell-ming-ling)

**MapReduce**

* [分布式处理框架MapReduce](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/mapreduce/fen-bu-shi-chu-li-kuang-jia-mapreduce)
* [MapReduce概览](https://github.com/collabH/repository/blob/master/bigdata/hadoop/MapReduce/MapReduceOverView.xmind)
* [MapReduce调优](https://github.com/collabH/repository/blob/master/bigdata/hadoop/MapReduce/MapReduce%E8%B0%83%E4%BC%98.xmind)
* [MapReduce数据相关操作](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/mapreduce/mapreduce-shu-ju-cao-zuo)
* [MapReduce输入输出剖析](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/mapreduce/mapreduce-shu-ru-shu-chu-pou-xi)
* [MapReduce的工作机制](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/mapreduce/mapreduce-de-gong-zuo-yuan-li-pou-xi)

**Yarn**

* [Yarn快速入门](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/yarn/yarn-kuai-su-ru-men)

**生产配置**

* [Hadoop高可用配置](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/hadoop-gao-ke-yong-pei-zhi)
* [Hadoop生产相关配置](https://shimin-huang.gitbook.io/doc/bigdata/hadoop/yarn/hadoop-xiang-guan-zu-jian-sheng-chan-ji-bie-pei-zhi)

#### Engine

**计算引擎相关，主要包含Flink、Spark等**

**Flink**

* 主要包含对Flink文档阅读的总结和相关Flink源码的阅读，以及Flink新特性记录等等

**Core**

* [FlinkOverView](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/core/flinkoverview)
* [CheckPoint机制](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/core/checkpoint-ji-zhi)
* [TableSQLOverview](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/core/tablesqloverview)
* [DataStream API](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/FlinkDataStream%20API.xmind)
* [ProcessFunction API](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/ProcessFunction%20API.xmind)
* [Data Source](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/Data%20Source.xmind)
* [Table API](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/TABLE%20API.xmind)
* [Flink SQL](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/FlinkSQL.xmind)
* [Flink Hive](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/Flink%20Hive.xmind)
* [Flink CEP](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/Flink%20Cep.xmind)
* [Flink Function](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/Flink%20Function.xmind)
* [DataSource API](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/core/Data%20Source.xmind)

**SourceCode**

* [FlinkCheckpoint源码分析](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/flinkcheckpoint-yuan-ma-fen-xi)
* [FlinkSQL源码解析](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/flinksql-yuan-ma-jie-xi)
* [Flink内核源码分析](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/flink-nei-he-yuan-ma-fen-xi)
* [Flink网络流控及反压](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/flink-wang-luo-liu-kong-ji-fan-ya)
* [TaskExecutor内存模型原理深入](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/taskexecutor-nei-cun-mo-xing-yuan-li-shen-ru)
* [Flink窗口实现应用](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/flink-chuang-kou-shi-xian-ying-yong-yuan-li)
* [Flink运行环境源码解析](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/flink-yun-hang-huan-jing-yuan-ma-jie-xi)
* [FlinkTimerService机制分析](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/flinktimerservice-ji-zhi-fen-xi)
* [StreamSource源解析](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/sourcecode/streamsource-yuan-jie-xi)
* [Flink状态管理与检查点机制](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/sourcecode/Flink%E7%8A%B6%E6%80%81%E7%AE%A1%E7%90%86%E4%B8%8E%E6%A3%80%E6%9F%A5%E7%82%B9%E6%9C%BA%E5%88%B6.xmind)

**Book**

**Flink内核原理与实现**

* [1-3章读书笔记](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/books/Flink%E5%86%85%E6%A0%B8%E5%8E%9F%E7%90%86%E4%B8%8E%E5%AE%9E%E7%8E%B0/1-3%E7%AB%A0%E8%AF%BB%E4%B9%A6%E7%AC%94%E8%AE%B0.xmind)
* [第4章时间与窗口](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/books/Flink%E5%86%85%E6%A0%B8%E5%8E%9F%E7%90%86%E4%B8%8E%E5%AE%9E%E7%8E%B0/%E7%AC%AC4%E7%AB%A0%E6%97%B6%E9%97%B4%E4%B8%8E%E7%AA%97%E5%8F%A3.xmind)
* [5-6章读书笔记](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/books/Flink%E5%86%85%E6%A0%B8%E5%8E%9F%E7%90%86%E4%B8%8E%E5%AE%9E%E7%8E%B0/5-6%E7%AB%A0%E7%B1%BB%E5%9E%8B%E5%BA%8F%E5%88%97%E5%8C%96%E5%92%8C%E5%86%85%E5%AD%98%E7%AE%A1%E7%90%86%E8%AF%BB%E4%B9%A6%E7%AC%94%E8%AE%B0.xmind)
* [第7章状态原理](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/books/Flink%E5%86%85%E6%A0%B8%E5%8E%9F%E7%90%86%E4%B8%8E%E5%AE%9E%E7%8E%B0/%E7%AC%AC7%E7%AB%A0%E7%8A%B6%E6%80%81%E5%8E%9F%E7%90%86.xmind)
* [第8章作业提交](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/books/Flink%E5%86%85%E6%A0%B8%E5%8E%9F%E7%90%86%E4%B8%8E%E5%AE%9E%E7%8E%B0/%E7%AC%AC8%E7%AB%A0%E4%BD%9C%E4%B8%9A%E6%8F%90%E4%BA%A4.xmind)
* [第9章资源管理](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/books/Flink%E5%86%85%E6%A0%B8%E5%8E%9F%E7%90%86%E4%B8%8E%E5%AE%9E%E7%8E%B0/%E7%AC%AC9%E7%AB%A0%E8%B5%84%E6%BA%90%E7%AE%A1%E7%90%86.xmind)
* [第10章作业调度](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/books/Flink%E5%86%85%E6%A0%B8%E5%8E%9F%E7%90%86%E4%B8%8E%E5%AE%9E%E7%8E%B0/%E7%AC%AC10%E7%AB%A0%E4%BD%9C%E4%B8%9A%E8%B0%83%E5%BA%A6.xmind)
* [第11-13章Task执行数据交换等](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/books/flink-nei-he-yuan-li-yu-shi-xian/di-1113-zhang-task-zhi-hang-shu-ju-jiao-huan-deng)

**Feature**

* [Flink1.12新特性](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/feature/flink1.12-xin-te-xing)
* [Flink1.13新特性](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/feature/flink1.13-xin-te-xing)
* [Flink1.14新特性](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/feature/flink1.14-xin-te-xing)

**Practice**

* [Flink踩坑指南](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/practice/Flink%E8%B8%A9%E5%9D%91.xmind)
* [记录一次Flink反压问题](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/practice/ji-lu-yi-ci-flink-fan-ya-wen-ti)
* [Flink SQL实践调优](https://github.com/collabH/repository/blob/master/bigdata/engine/flink/practice/Flink%20SQL%E8%B0%83%E4%BC%98.xmind)
* [Flink On K8s实践](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/practice/flink-on-k8s)

**Connector**

* [自定义Table Connector](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/connector/zi-ding-yi-tableconnector)

**monitor**

* [搭建Flink任务指标监控系统](https://shimin-huang.gitbook.io/doc/bigdata/engine/flink/monitor/da-jian-flink-ren-wu-zhi-biao-jian-kong-xi-tong)

**Spark**

**主要包含Spark相关书籍读书笔记、Spark核心组件分析、Spark相关API实践以及Spark生产踩坑等。**

* [Spark基础入门](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/Spark%E5%9F%BA%E7%A1%80%E5%85%A5%E9%97%A8.xmind)
* [SparkOnDeploy](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/sparkondeploy)
* [Spark调度系统](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/spark-tiao-du-xi-tong)
* [Spark计算引擎和Shuffle](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/spark-ji-suan-yin-qing-he-shuffle)
* [Spark存储体系](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/spark-cun-chu-ti-xi)
* [Spark大数据处理读书笔记](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/Spark%E5%A4%A7%E6%95%B0%E6%8D%AE%E5%A4%84%E7%90%86%E8%AF%BB%E4%B9%A6%E7%AC%94%E8%AE%B0.xmind)

**Spark Core**

* [SparkCore](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/spark%20core/Spark%20Core.xmind)
* [SparkOperator](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/spark%20core/Spark%20Operator.xmind)
* [SparkConnector](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/spark%20core/Spark%20Connector.xmind)

**Spark SQL**

* [SparkSQLAPI](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/spark%20sql/Spark%20SQL%20API.xmind)
* [SparkSQL](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/spark%20sql/Spark%20SQL.xmind)
* [SparkSQL API](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/spark-sql/sparksql-api)
* [SparkSQL优化分析](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/spark-sql-1/sparksql-you-hua-fen-xi)

**Spark Practice**

* [Spark生产实践](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/practice/spark-sheng-chan-shi-jian)

**Spark Streaming**

* [SparkStreaming](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/spark%20streaming/Spark%20Steaming.xmind)
* [SparkStreaming整合Flume](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/spark-streaming/sparkstreaming-zheng-he-flume)

**源码解析**

* [从浅到深剖析Spark源码](https://shimin-huang.gitbook.io/doc/bigdata/engine/spark/cong-qian-dao-shen-pou-xi-spark-yuan-ma)
* [源码分析系列](https://github.com/collabH/repository/blob/master/bigdata/engine/spark/%E6%BA%90%E7%A0%81%E5%88%86%E6%9E%90/README.md)

#### Collect

**数据采集框架，主要包含Binlog增量与SQL快照方式框架**

#### Canal

* [CanalOverView](https://shimin-huang.gitbook.io/doc/bigdata/collect/canal/canaloverview)

#### Debezium

* [DebeziumOverView](https://shimin-huang.gitbook.io/doc/bigdata/collect/debezium/debeziumoverview)
* [Debezium踩坑](https://github.com/collabH/repository/blob/master/bigdata/collect/debezium/Debezium%E8%B8%A9%E5%9D%91.xmind)
* [Debezium监控系统搭建](https://shimin-huang.gitbook.io/doc/bigdata/collect/debezium/debezium-jian-kong-xi-tong-da-jian)
* [Debezium使用改造](https://shimin-huang.gitbook.io/doc/bigdata/collect/debezium/debezium-shi-yong-gai-zao)

**Flume**

* [Flume快速入门](https://shimin-huang.gitbook.io/doc/bigdata/collect/flume/flumeoverwrite)
* [Flume对接Kafka](https://shimin-huang.gitbook.io/doc/bigdata/collect/flume/flume-dui-jie-kafka)

**Sqoop**

* [SqoopOverview](https://shimin-huang.gitbook.io/doc/bigdata/collect/sqoop/sqoopoverview)
* [Sqoop实战操作](https://shimin-huang.gitbook.io/doc/bigdata/collect/sqoop/sqoop-shi-zhan-cao-zuo)

#### MQ

**消息中间件相关，主要包含大数据中使用比较多的Kafka和Pulsar**

**Kafka**

* [kafka概览](https://github.com/collabH/repository/blob/master/bigdata/mq/kafka/KafkaOverView.xmind)
* [基本概念](https://shimin-huang.gitbook.io/doc/bigdata/mq/kafka/ji-ben-gai-nian)
* [kafka监控](https://shimin-huang.gitbook.io/doc/bigdata/mq/kafka/kafka-jian-kong)
* [生产者源码剖析](https://shimin-huang.gitbook.io/doc/bigdata/mq/kafka/sheng-chan-zhe-yuan-ma-pou-xi)
* [消费者源码剖析](https://shimin-huang.gitbook.io/doc/bigdata/mq/kafka/xiao-fei-zhe-yuan-ma-pou-xi)
* [kafkaShell](https://github.com/collabH/repository/blob/master/bigdata/mq/kafka/KafkaShell.xmind)
* [kafka权威指南读书笔记](https://github.com/collabH/repository/blob/master/bigdata/mq/kafka/kafka%E6%9D%83%E5%A8%81%E6%8C%87%E5%8D%97/README.md)
* [深入理解Kafka读书笔记](https://github.com/collabH/repository/blob/master/bigdata/mq/kafka/%E6%B7%B1%E5%85%A5%E7%90%86%E8%A7%A3Kafka/README.md)

**Pulsar**

* [快速入门](https://shimin-huang.gitbook.io/doc/bigdata/mq/pulsar/1.-kuai-su-ru-men)
* [原理与实践](https://shimin-huang.gitbook.io/doc/bigdata/mq/pulsar/2.-yuan-li-yu-shi-jian)

#### Zookeeper

* [Zookeeper原理和参数配置](https://shimin-huang.gitbook.io/doc/bigdata/zookeeper/zookeeperoverview)
* [Zookeeper操作与部署](https://shimin-huang.gitbook.io/doc/bigdata/zookeeper/zookeeper-cao-zuo-yu-bu-shu)

#### schedule

**Azkaban**

* [Azkaban生产实践](https://shimin-huang.gitbook.io/doc/bigdata/scheduler/azkaban-sheng-chan-shi-jian)

**DolphinScheduler**

* [DolphinScheduler快速开始](https://shimin-huang.gitbook.io/doc/bigdata/scheduler/dolphinscheduler-kuai-su-kai-shi)

#### olap

**主要核心包含Kudu、Impala相关Olap引擎，生产实践及论文记录等。**

**Hive**

* [HiveOverwrite](https://shimin-huang.gitbook.io/doc/bigdata/olap/hive/hiveoverwrite)
* [Hive SQL](https://github.com/collabH/repository/blob/master/bigdata/olap/hive/Hive%20SQL.xmind)
* [Hive调优指南](https://github.com/collabH/repository/blob/master/bigdata/olap/hive/Hive%E8%B0%83%E4%BC%98%E6%8C%87%E5%8D%97.xmind)
* [Hive踩坑解决方案](https://github.com/collabH/repository/blob/master/bigdata/olap/hive/Hive%E8%B8%A9%E5%9D%91%E8%A7%A3%E5%86%B3%E6%96%B9%E6%A1%88.xmind)
* [Hive编程指南读书笔记](https://github.com/collabH/repository/blob/master/bigdata/olap/hive/hive%E7%BC%96%E7%A8%8B%E6%8C%87%E5%8D%97/README.md)
* [Hive Shell Beeline](https://shimin-huang.gitbook.io/doc/bigdata/olap/hive/hive-shell-he-beeline-ming-ling)
* [Hive分区表和分桶表](https://shimin-huang.gitbook.io/doc/bigdata/olap/hive/hive-fen-qu-biao-he-fen-tong-biao)

**Presto**

* [presto概述](https://shimin-huang.gitbook.io/doc/bigdata/olap/presto/prestooverview)

**clickhouse**

* [ClickHouse快速入门](https://shimin-huang.gitbook.io/doc/bigdata/olap/clickhouse/clickhouseoverview)
* [ClickHouse表引擎](https://github.com/collabH/repository/blob/master/bigdata/olap/clickhouse/ClickHouse%E8%A1%A8%E5%BC%95%E6%93%8E.xmind)

**Druid**

* [Druid概述](https://shimin-huang.gitbook.io/doc/bigdata/olap/druid/druidoverview)

**Kylin**

* [Kylin概述](https://shimin-huang.gitbook.io/doc/bigdata/olap/kylin/kylinoverwrite)

**Kudu**

* [KuduOverView](https://shimin-huang.gitbook.io/doc/bigdata/olap/kudu/kuduoverview)
* [Kudu表和Schema设计](https://shimin-huang.gitbook.io/doc/bigdata/olap/kudu/kuduschemadesgin)
* [KuduConfiguration](https://shimin-huang.gitbook.io/doc/bigdata/olap/kudu/kuduconfiguration)
* [Kudu原理分析](https://shimin-huang.gitbook.io/doc/bigdata/olap/kudu/kudu-yuan-li-fen-xi)
* [Kudu踩坑](https://github.com/collabH/repository/blob/master/bigdata/olap/kudu/Kudu%E8%B8%A9%E5%9D%91.xmind)
* [Kudu存储结构架构图](https://github.com/collabH/repository/blob/master/bigdata/olap/kudu/Kudu%E5%AD%98%E5%82%A8%E7%BB%93%E6%9E%84/README.md)
* [Kudu生产实践](https://shimin-huang.gitbook.io/doc/bigdata/olap/kudu/kudu-sheng-chan-shi-jian)

**paper**

* [Kudu论文阅读](https://shimin-huang.gitbook.io/doc/bigdata/olap/kudu/paper/kudupaper-yue-du)

**Impala**

* [ImpalaOverView](https://shimin-huang.gitbook.io/doc/bigdata/olap/impala/impalaoverview)
* [ImpalaSQL](https://github.com/collabH/repository/blob/master/bigdata/olap/impala/Impala%20SQL.xmind)
* [Impala操作KUDU](https://shimin-huang.gitbook.io/doc/bigdata/olap/impala/shi-yong-impala-cha-xun-kudu-biao)
* [Impala生产实践](https://shimin-huang.gitbook.io/doc/bigdata/olap/impala/impala-sheng-chan-shi-jian)

#### graph

**图库相关**

**nebula graph**

* [1.简介](https://shimin-huang.gitbook.io/doc/bigdata/graph/nebula-graph/1.-jian-jie)
* [2.快速入门](https://shimin-huang.gitbook.io/doc/bigdata/graph/nebula-graph-1/2.-kuai-su-ru-men)

#### tools

**工具集相关，包含计算平台、sql语法Tree等**

**zeppelin**

* [zeppelin](https://github.com/collabH/repository/blob/master/bigdata/tools/zeppelin/Zeppelin.xmind)

**SQL语法树**

**calcite**

* [ApacheCalciteOverView](https://shimin-huang.gitbook.io/doc/bigdata/tools/sqltree/calcite/calciteoverview)

### 数据仓库建设

#### 理论

* [数据建模](https://shimin-huang.gitbook.io/doc/datawarehouse/li-lun/datamodeler)
* [数据仓库建模](https://github.com/collabH/repository/blob/master/datawarehouse/%E7%90%86%E8%AE%BA/%E6%95%B0%E6%8D%AE%E4%BB%93%E5%BA%93%E5%BB%BA%E6%A8%A1.xmind)
* [数据仓库](https://shimin-huang.gitbook.io/doc/datawarehouse/li-lun/shu-ju-cang-ku-shi-zhan)

#### 数据中台设计

* [数据中台设计](https://shimin-huang.gitbook.io/doc/datawarehouse/shu-ju-zhong-tai-mo-kuai-she-ji/shu-ju-zhong-tai-she-ji)
* [thoth自研元数据平台设计](https://shimin-huang.gitbook.io/doc/datawarehouse/shu-ju-zhong-tai-mo-kuai-she-ji/thoth-zi-yan-yuan-shu-ju-ping-tai-she-ji)

#### 方案实践

* [Kudu数据冷备](https://shimin-huang.gitbook.io/doc/datawarehouse/fang-an-shi-jian/kudu-shu-ju-leng-bei-fang-an)
* [基于Flink的实时数仓建设](https://shimin-huang.gitbook.io/doc/datawarehouse/fang-an-shi-jian/ji-yu-flink-de-shi-shi-shu-cang-jian-she)

#### 读书笔记

* [数据中台读书笔记](https://shimin-huang.gitbook.io/doc/datawarehouse/li-lun/shu-ju-zhong-tai-du-shu-bi-ji)

### devops

* [shell命令](https://github.com/collabH/repository/blob/master/devops/Shell%E5%AD%A6%E4%B9%A0.xmind)
* [Linux命令](https://github.com/collabH/repository/blob/master/devops/Linux%E5%AD%A6%E4%B9%A0.xmind)
* [openshift基础命令](https://shimin-huang.gitbook.io/doc/datawarehouse/li-lun/devops/k8sopenshift-ke-hu-duan-ming-ling-shi-yong)

### maven

* [maven骨架制作](https://shimin-huang.gitbook.io/doc/datawarehouse/li-lun/devops/maven/zhi-zuo-maven-gu-jia)
* [maven命令](https://shimin-huang.gitbook.io/doc/datawarehouse/li-lun/devops/maven/maven-ming-ling)

### 服务监控

* [Prometheus](https://shimin-huang.gitbook.io/doc/servicemonitor/prometheus/prometheus-shi-zhan)

### mac

* [iterm2](https://shimin-huang.gitbook.io/doc/mac/iterm2)

## 贡献方式

* 欢迎通过[Gitter](https://gitter.im/collabH-repository/community)参与贡献
* [贡献者指南](https://shimin-huang.gitbook.io/doc/contributing)

## 技术分享

![](https://2758483936-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FcIRMOn4u9hdIi6agitCf%2Fuploads%2Fgit-blob-465c750e5a72dff34adc662fe684153378aea497%2F%E5%85%AC%E4%BC%97%E5%8F%B7.jpeg?alt=media)
