Welcome to this data science interview repository.
This repository is meant to be used with GitHub Codespaces. To start using this repository:
- Connect to your GitHub account.
- Click Use this template, then click Open in a codespace.
- You should accept to download the Python extension for Visual Studio Code.
- When choosing the kernel, you should choose
feedgy-ds-repo-template-NK18vZRh-py3.10
.
In this repository, you will find a dataset of 15-minute time series data from 3 solar plants. The columns of the dataset are as follows:
mpppt_id
: ID of the mpppt (a group of panels in the power plant)date
: Datepower
: Power produced by the mppptvoltage
: Voltage of the mppptcurrent
: Current of the mppptpoa_global
: Light received by the panelstemperature_ambiante
: Ambient temperaturepressure
: Ground pressurewind_speed
: Ground wind speedrainfall
: Amount of rainsnowfall
: Amount of snowsnow_depth
: Amount of snow on the groundglobal_horizontal_irradiance
: Light received from above by a surface horizontal to the groundclear_sky_global_horizontal_irradiance
: Light received from above by a surface horizontal to the ground if no cloudsfarm
: Solar farm namemppt_config
: Concatenation of tilt and azimuth of an mppt. For the same farm, if two mppts are in the same mppt_config, they should produce the same power
During the interview, you will be asked to access data, understand it, clean it, and build a regression model.