Job at LIRIS

During my internship in LIRIS research laboratory, I was assigned to a project that was part of a thesis. The goal was to evaluate a dataset prior to machine learning training in order to identify any incorrect data. To achieve this, I created Python algorithms using blocking methods to analyze the dataset and return the conflicting records.

I was responsible for developing and implementing the algorithms, testing them, and analyzing the results. I also had to work with large datasets and manage the data effectively, as well as using various libraries like pandas and numpy.

This experience allowed me to develop my skills in data analysis, data preprocessing, and machine learning. It also helped me to gain a deeper understanding of the importance of data quality in machine learning and how to ensure the data is correct before training a model.

Related