先看下面es查询语句
{
"size": 0,
"aggs" : {
"all_articleId" : {
"terms" : {
"field" : "articleId"
}
}
}
}
得到的结果:
该索引下有2w多条数据,经过聚合分桶后,也绝对不仅仅只是10个bucket,很显然,这似乎不是我想要的结果,经过查官方API发现下面一段话:
Edit
Updating the answer based on @PhaedrusTheGreek comment. setting size:0 is deprecated in 2.x onwards, due to memory issues inflicted on your cluster with high-cardinality field values. You can read more about it in the github issue here . It is recommended to explicitly set reasonable value for size a number between 1 to 2147483647.
意思就是说:在聚合中设置 size:0 在2.x之后已被弃用,建议为 size 明确设置合理的值,一个介于1到2147483647之间的数字。之后我将语句改为如下
{
"size": 0,
"aggs" : {
"all_articleId" : {
"terms" : {
"field" : "articleId",
"size": 20000
}
}
}
}
所查询到的bucket有200多,正式自己要的结果