Data Engeneering Pluimvee

schema beforeStar Schema

What?

Evap Proefbedrijf Pluimveehouderij is a company that collects data from tests on chickens. They want to structure this data and make it more accessible.

How?

In this project, I was tasked with creating a star schema and making sure that the data was loaded into the database. I also needed to make a script so that I could transform the data that was delivered to me from their database into a more readable and usable form(The STAR Model). Using Python and Pandas I wrote a script capable of cleaning the data and transforming it into this model.

Conclusion

I accomplished writing a script that opens Excel files and combines that information with information obtained from a database, into star schema. This configuration is then saved to a new database. During this project, I learned to apply my data science knowledge gained from classes in practice. I became proficient in using Pandas to manipulate datasets.

Technologies used

  • Dimensional Modeling
  • Python

Github