• Data preparation as a service based on Apache Spark 

      Mahasivam, Nivethika; Nikolov, Nikolay; Sukhobok, Dina; Roman, Dumitru (Journal article; Peer reviewed, 2017)
      Data preparation is the process of collecting, cleaning and consolidating raw datasets into cleaned data of certain quality. It is an important aspect in almost every data analysis process, and yet it remains tedious and ...