Assume I've millions and millions of small files now. And this would grow over period. What would be the best I could choose between storing small files in HBase vs HDFS.
Storing in hdfs in different flavours like
1) Storing small files within a directory and retrieving 2) Storing small files as sequence files 3) Choosing to store HAR files - I couldn't understand this well (https://blog.cloudera.com/blog/2009/02/the-small-files-problem/) I suggested this after referring this article.