I need to store around 100 millions of records on the database. Around 60-70% of them will be deleted daily and same amount of records are inserted daily. I feel a document database like Hbase, Big Table would fit in this. There are many other data stores like Cassandra, MongoDb, etc. Which data store would be useful for this kind of problem as there will be huge amount of reads/writes(order of 10's of millions) daily.
我需要在数据库中存储大约1亿条记录。其中约60-70%将每天删除,每天插入相同数量的记录。我觉得像Hbase这样的文档数据库,Big Table适合这个。还有许多其他数据存储,如Cassandra,MongoDb等。哪种数据存储对这类问题很有用,因为每天会有大量的读/写(数百万的订单)。
1 个解决方案
#1
3
Based on the characteristics you've mentioned (JSON Documents, accesses by key, 100 million records, balanced read/write) I'd say CouchDB or Membase are good candidates (here's a quick comparison)
根据您提到的特性(JSON文档,按键访问,1亿条记录,平衡读/写)我会说CouchDB或Membase是很好的候选者(这里是一个快速比较)
Both HBase and Cassandra can probably also work but for HBase you'd need to install a lot of components (Hadoop, ZooKeeper etc) that you won't really use d only use and Cassandra is better when you have more writes than read (at least the last time I used it).
HBase和Cassandra都可以工作但是对于HBase你需要安装很多你不会真正使用的组件(Hadoop,ZooKeeper等),当你有更多的写入而不是读取时,Cassandra会更好(在至少我最后一次使用它)。
Big Table, is unfortunately internal to google : )
不幸的是,Big Table是谷歌内部的:)
#1
3
Based on the characteristics you've mentioned (JSON Documents, accesses by key, 100 million records, balanced read/write) I'd say CouchDB or Membase are good candidates (here's a quick comparison)
根据您提到的特性(JSON文档,按键访问,1亿条记录,平衡读/写)我会说CouchDB或Membase是很好的候选者(这里是一个快速比较)
Both HBase and Cassandra can probably also work but for HBase you'd need to install a lot of components (Hadoop, ZooKeeper etc) that you won't really use d only use and Cassandra is better when you have more writes than read (at least the last time I used it).
HBase和Cassandra都可以工作但是对于HBase你需要安装很多你不会真正使用的组件(Hadoop,ZooKeeper等),当你有更多的写入而不是读取时,Cassandra会更好(在至少我最后一次使用它)。
Big Table, is unfortunately internal to google : )
不幸的是,Big Table是谷歌内部的:)