An efficient approach to machine learning based text classification through distributed computing