1. TeraSortFor the TeraSort testing, we used a 1TB dataset fed by TeraGen. We discovered that NetApp Cloud Volumes Service performed 22% faster than competing object storage in the cloud.
Faster storage performance can result in less compute time (and therefore less cost) waiting for I/O to respond. More importantly, it means that you can get your results faster.
2. LLAPWe also tested the same configuration using the LLAP benchmark and found that, on average, Cloud Volumes Service was 16% faster than the leading object storage. Peak throughput was 4323.55 MBps, which is a pretty solid number for this kind of workload in the cloud.
We will continue testing with Hortonworks, and we intend to offer additional guidance on recommended configurations for optimum performance and cost.
Conclusion
The Hortonworks HDP 3.0 certification for NetApp Cloud Volumes Service using HDFS over NFS means that you can deploy this solution for your big data projects with confidence. Even with limited performance testing, this solution demonstrates excellent performance for your data analytics projects.
Environment Setups Details