Recommendations to others considering Apache Phoenix:
Try all SQL queries first that would be required. Eg. window function, ranks are not there. Review collected by and hosted on G2.com.
What problems is Apache Phoenix solving and how is that benefiting you?
Using Bulk Tools is pain, we use csv dump as of now.
CSV dump misses a lot of features (compare with pg copy command)
We have just 1 table which has a row combination of 2 cols. The table contains ~1TB of data.
Benefits -
Very cost effective, using existing resources hbase, hadoop.
Good for a direct usecase (but if someone wants a primary db better to try other dbs like cassandra, mongo etc based on use case)
Bulk Tools are something very good and I first used it with phoenix , however support for more formats like parquet, orc should be used built in bulk tools because data isn't store in csv/json/tsv on datalake) Review collected by and hosted on G2.com.