data preprocessing in google colab