0%

ClickHouse-Trial

阅读更多

1 Tips

1.1 如何创建分布式表

Distributed Table Engine

首先,在每个节点创建本地表

1
CREATE TABLE IF NOT EXISTS t0_local (fn_int UInt32,fn_int_null UInt32, par_int_low UInt32, par_int_high UInt32, par_str_low LowCardinality(String), par_str_high String) ENGINE = MergeTree() PRIMARY KEY (fn_int)

然后,在任意一个节点,创建分布式表

1
CREATE TABLE IF NOT EXISTS t0 (fn_int UInt32,fn_int_null UInt32, par_int_low UInt32, par_int_high UInt32, par_str_low LowCardinality(String), par_str_high String) ENGINE = Distributed('perftest_3shards_1replicas', 'analytic_1M', 't0_local', fn_int)

这流程,属实一言难尽😢

1.2 如何查看配置

1
select * from system.settings;

1.3 如何删除数据

TRUNCATE Statement

1.4 clickhouse-client

  • -m:多行模式,以;作为SQL结束标识符,默认以换行作为结束标识符
1
clickhouse-client -h 172.26.194.221 --password sr -m

1.5 How to output table to csv

1
SELECT * FROM lineitem limit 5 INTO OUTFILE '/path/test.csv' FORMAT CSV;

1.6 How to output hdfs file to csv

Follow doc Install ClickHouse, start a clickhouse local, and execute sql like

1
2
INSERT INTO FUNCTION file('ssb100.csv', `CSV`)
SELECT * FROM hdfs('hdfs://xxx:12000/user/hive/warehouse/lineorder_flat/*', 'Parquet');