Towards a systematic study of big data performance and benchmarking