.NET的BLOB分布式存储?[英] Distributed Storage of BLOBs for .NET?

本文是小编为大家收集整理的关于.NET的BLOB分布式存储?的处理/解决方法,可以参考本文帮助大家快速定位并解决问题,中文翻译不准确的可切换到English标签页查看源文。

问题描述

我正在寻找一个经过良好测试的库+服务器来存储持久的分布式哈希表.

我犹豫不决地使用基于SQL的解决方案,因为数据是高度文档的,由数百万个〜64KB的斑点组成,只有一个索引(由该索引计算),并且需要能够分布长期缩放前景.

由于费用和带宽的注意事项,诸如S3之类的外部解决方案是选择.

诸如couchdb或Project voldemort之类的东西是理想的 - 但是,两者都缺乏.NET绑定(可以是Java的PV绑定 - 但是有"问题".).密钥和值都是字节数组(键是16个字节,值最高为2048kb平均64kb)

我到目前为止已经搜索了某种发电机,和弦和类似的.NET端口 - 但是,大多数结果似乎是纯粹的内存缓存,并且缺乏任何形式的持久性或复制形式.

有人有任何想法或建议吗?

推荐答案

看一下Ayende的 rhino dht .可能与您想要的内容更加内联.可以获取源在这里..

其他推荐答案

dryadlinq " http://code.google.com/p/hadoopdotnet/" rel =" nofollow noreferrer"> hadoop.net 可能会有所帮助.

hadoop.net是hadoop的dotnet版本.可以找到有关Hadoop的更多信息,可以找到在这里

其他推荐答案

我实际上认为您应该考虑SQL Server2008.将数据存储在带有Varbinary(Max)列的表中,以及包含该列的哈希的列.如您所建议的那样,索引哈希.

然后,您将能够使用产品的各种分销功能.

本文地址:https://www.itbaoku.cn/post/597592.html

问题描述

I am looking for a reasonably well tested library+server to store a persistent distributed hash table.

I am hesistant to use SQL-based solutions as the data is highly document oriented, consisting of millions of ~64KB blobs with only a single index (computed by hash of said BLOB) - and needs to be able to be distributed for long term scaling prospects.

Due to expense and bandwidth considerations, external solutions such as S3 are not an option.

Something like CouchDB or Project Voldemort would be ideal - however there is a noticable lack of .NET bindings for both (PV can be IKVMC'd from Java - however has "issues".). Both key and value are byte arrays (key is 16 byte, the value is up to 2048KB averaging 64KB)

I have searched so far for some kind of .NET port of Dynamo, Chord and similar - however the majority of results appear to be purely in-memory caches and lack any form of persistence or replication.

Anyone got any ideas or suggestions?

推荐答案

Take a look at Ayende's Rhino DHT. Might be more inline with what you are looking for. The source can be acquired here.

其他推荐答案

DryadLINQ or Hadoop.Net may help.

Hadoop.Net is dotnet version of Hadoop. More about Hadoop can be found here

其他推荐答案

I actually think you should consider SQL Server 2008. Store the data in a table with a varbinary(max) column, along with a column that contains the hash of that column. Index the hash, as you suggested.

You'll then be able to use the various distribution features of the product.