I'm looking at solutions to store a massive quantity of information consuming the less possible disk space.
我正在寻找能够存储大量信息的解决方案,从而消耗更少的磁盘空间。
The information structure is very simple and the queries will also be very simple. I've looked at solutions like Apache Cassandra and relations databases but couldn't find a comparison where disk usage is mentioned.
信息结构非常简单,查询也很简单。我已经看过像Apache Cassandra和关系数据库这样的解决方案,但是找不到提到磁盘使用情况的比较。
Any ideas on this would be great.
对此的任何想法都会很棒。
4 个解决方案
#1
2
Take a look at Oracle Berkeley DB - very simple robust database (key/value):
看看Oracle Berkeley DB - 非常简单的健壮数据库(键/值):
"Berkeley DB enables the development of custom data management solutions, without the overhead traditionally associated with such custom projects. Berkeley DB provides a collection of well-proven building-block technologies that can be configured to address any application need from the handheld device to the datacenter, from a local storage solution to a world-wide distributed one, from kilobytes to petabytes."
“Berkeley DB支持开发定制数据管理解决方案,无需传统上与此类定制项目相关的开销.Berkeley DB提供了一系列经过验证的构建块技术,可以配置这些技术以满足从手持设备到手持设备的任何应用需求。数据中心,从本地存储解决方案到世界范围内的分布式解决方案,从千字节到千兆字节。“
#2
3
Speaking about Apache Cassandra - it's just a disk space hog. 200 MB of logs resulted in 1.2 GB files produced by Cassandra - and the keyspace was just 4 columns with 200 length strings.
谈到Apache Cassandra - 它只是一个磁盘空间的猪。 200 MB的日志产生了Cassandra生成的1.2 GB文件 - 键空间只有4列,200个长度字符串。
#4
0
Newest version of Microsoft's SQL Server (2008) supports several levels of compression (row compression and page compression, in addition to backup compression). Might be worth investigating.
最新版本的Microsoft SQL Server(2008)支持多种级别的压缩(行压缩和页面压缩,以及备份压缩)。可能值得调查。
Some relevant resources:
一些相关资源:
- Linchi Shea shows that compression can sometimes improve performance
- Linchi Shea表明压缩有时可以提高性能
- Official MS Best Pracices doc for SQL 2008 compression
- SQL 2008压缩的官方MS Best Pracices doc
#1
2
Take a look at Oracle Berkeley DB - very simple robust database (key/value):
看看Oracle Berkeley DB - 非常简单的健壮数据库(键/值):
"Berkeley DB enables the development of custom data management solutions, without the overhead traditionally associated with such custom projects. Berkeley DB provides a collection of well-proven building-block technologies that can be configured to address any application need from the handheld device to the datacenter, from a local storage solution to a world-wide distributed one, from kilobytes to petabytes."
“Berkeley DB支持开发定制数据管理解决方案,无需传统上与此类定制项目相关的开销.Berkeley DB提供了一系列经过验证的构建块技术,可以配置这些技术以满足从手持设备到手持设备的任何应用需求。数据中心,从本地存储解决方案到世界范围内的分布式解决方案,从千字节到千兆字节。“
#2
3
Speaking about Apache Cassandra - it's just a disk space hog. 200 MB of logs resulted in 1.2 GB files produced by Cassandra - and the keyspace was just 4 columns with 200 length strings.
谈到Apache Cassandra - 它只是一个磁盘空间的猪。 200 MB的日志产生了Cassandra生成的1.2 GB文件 - 键空间只有4列,200个长度字符串。
#3
#4
0
Newest version of Microsoft's SQL Server (2008) supports several levels of compression (row compression and page compression, in addition to backup compression). Might be worth investigating.
最新版本的Microsoft SQL Server(2008)支持多种级别的压缩(行压缩和页面压缩,以及备份压缩)。可能值得调查。
Some relevant resources:
一些相关资源:
- Linchi Shea shows that compression can sometimes improve performance
- Linchi Shea表明压缩有时可以提高性能
- Official MS Best Pracices doc for SQL 2008 compression
- SQL 2008压缩的官方MS Best Pracices doc