Is there an accepted way of storing and accessing sparse numerical data (such as a search engine's inverted index / term by document matrix)? An RDBMS seems inappropriate for this kind of data, but it would be good to have it stored in some kind of database (saved to disk, running as a server, etc). Is there an accepted solution for this kind of problem (such as an existing database capable of supporting this kind of model)? Anyone know how Google stores and accesses their indexes so fast?
是否存在一种可接受的存储和访问稀疏数值数据的方式(例如搜索引擎的反向索引/术语按文档矩阵)? RDBMS似乎不适合这种数据,但最好将它存储在某种数据库中(保存到磁盘,作为服务器运行等)。是否存在针对此类问题的可接受解决方案(例如,能够支持此类模型的现有数据库)?有人知道Google如何快速存储和访问他们的索引吗?
1 个解决方案
#1
Have a look here for more info on Google and links to more info.
在这里查看有关Google的更多信息以及更多信息的链接。