I would like to get some kind of histogram of data distribution in column in order to understand statistic problems.
我想在列中获得某种数据分布直方图,以便了解统计问题。
The first step is to get total information:
第一步是获取全部信息:
help stats tbName;
For sure it is not enough, I will just get number of unique values. So how can I further investigate data distribution?
当然这还不够,我会得到一些独特的价值。那么我该如何进一步调查数据分布呢?
1 个解决方案
#1
1
SHOW STATS VALUES COLUMN myCol ON myTable;
returns all details about histograms and biased values.
返回有关直方图和偏差值的所有详细信息。
For multi-column stats:
对于多列统计信息:
SHOW STATS VALUES COLUMN (myCol1, myCol2) ON myTable;
For all stats defined on a table:
对于在表上定义的所有统计信息:
SHOW STATS VALUES ON myTable;
#1
1
SHOW STATS VALUES COLUMN myCol ON myTable;
returns all details about histograms and biased values.
返回有关直方图和偏差值的所有详细信息。
For multi-column stats:
对于多列统计信息:
SHOW STATS VALUES COLUMN (myCol1, myCol2) ON myTable;
For all stats defined on a table:
对于在表上定义的所有统计信息:
SHOW STATS VALUES ON myTable;