GlusterFS with erasure coding over WAN
I am thinking ten cheap storage VPS at different providers bound together into one storage cluster using erasure coding say 10-5.
Ultimately moving to 15-10 with 1TB each and effectively able to store 10TB data with only 5TB overhead and ability to withstand five VPS going down at any time plus ease of heterogeneous growth of the cluster.
I know already that performance is going to be bad but can anyone guess how bad?
I am testing Tahoe-lafs in 10-5 setup and I get 800kBytes/s to 1.2MByte/s writes to the cluster.
I haven't tested reads yet nor actually started searching for any bottlenecks, but as a rule of thumb, is it going to be better or worse?