Course info and materials.
Practicals: Ondřej Týbl, Filip Bočinec ([surname]@karlin.mff.cuni.cz)
Time: Monday 12:20
Room: K4
Date | Topic | Homework Assignment | Homework Deadline1 | Lecturer |
---|---|---|---|---|
29.09.2025 | introduction | click | 15.10.2025 | Filip |
13.10.2025 | numpy | [TODO] | 20.10.2025 | Filip |
27.10.2025 | pandas | [TODO] | 03.11.2025 | Ondřej |
10.11.2025 | sql | [TODO] | 17.11.2025 | Ondřej |
24.11.2025 | matplotlib | [TODO] | 01.12.2025 | Ondřej |
08.12.2025 | scikit-learn | [TODO] | 15.12.2025 | Filip |
05.01.2026 | Object-oriented programming | [TODO] | 12.01.2026 | Ondřej |
Several homework assignments will be given. You are required to submit a solution for each assignment by the respective deadline. Your code will be evaluated based on readability, efficiency, correctness, and whether it runs successfully.
After each deadline, we check your GitHub repositories and evaluate your solutions as passed
, revision
or failed
. You need all the solutions to be marked as passed
by the end of January 2026
to obtain the course credits. If marked as revision
, you can resubmit your improved solution to obtain passed
(no submissions after the end of January 2026
).
[TODO]
In our course, we will learn how to use a whole bunch of tools and technologies that form the foundation of data science.
Category | Solution | Description |
---|---|---|
Programming Language | Python | The language used to write and execute code. |
Integrated Development Environment (IDE) | PyCharm | Editor for writing, debugging, and managing projects (+ many more). |
Virtual Environment | venv | An isolated environment to manage project-specific package dependencies. |
Script / Notebook | Jupyter Notebook | An interactive file format/environment for running and documenting code. |
Version Control System (VCS) | Git | Tracks code changes and manages project history. |
VCS Hosting Platform | GitHub | Cloud-based platform for hosting and collaborating on Git repositories. |
Below, a detailed instructions on how to set everything up is provided.
- Download the installer here.
- Install PyCharm using default options.
-
Windows
Download Git for Windows (use the 64-bit Git for Windows Setup).
Install with the recommended (default) settings.
Verify installation by opening Command Prompt and typing:git --version
-
macOS
Install Xcode Command Line Tools (which include Git) by running:xcode-select --install
Verify installation by typing:
git --version
- Go to GitHub and create an account.
- Fork the course repository by clicking Fork here: python-for-data-science.
(We strongly recommend keeping the default repository name.) - A fork enables you to commit your own changes in a separate copy of the repository.
- (Windows) Select Clone Repository.
- (Mac) Select Get from VCS.
- Enter the URL of your forked repository:
https://github.com/[your-username]/python-for-data-science
- Select Clone.
- (Mac) In the bottom-right corner of PyCharm, click on
<No Interpreter>
, select Add New Interpreter → Add Local Interpreter. Create a new virtual environment with Python 3.11 (if not present, PyCharm will download it). Open the Terminal in PyCharm (icon in the bottom-left corner). Make sure your command line starts with(.venv)
→ this confirms the virtual environment is active. Install all required packages:pip install -r requirements.txt
- (Windows) After project is created, Creating Virtual Environment window should pop up (if not, follow Mac instructions). As base interpreter, choose Python 3.11 and click OK
- A blue progress bar will appear at the bottom of PyCharm when processes such as package installation are running. Please wait until it completes.
- If you need additional packages later, install them via:
pip install [package]
- Open
assignment1.ipynb
in PyCharm - Click Run to verify everything works.
- Add a new cell that prints your GitHub username.
- Submit your solution with Git → Commit → Commit and Push.
- Enter a commit message.
- Click Commit and Push.
- Now GitHub wants to authenticate you. Select authentitacion via Token, now a new window where you should enter your token will appear. To create this token, go to GitHub website select your account icon in the top-right –> Settings –> Developer Settings –> Personal access tokens –> Tokens (classic) –> Generate new token –> add some description, validity deadline, select repo and Generate token. The token will appear and paste it in your PyCharm.
- Verify your changes at:
https://github.com/[your-username]/python-for-data-science
- Tell us your name and GitHub username.
All future homework solutions will be submitted the same way.
You are ready to start working on the course!
- Problem: When creating virtual environment, Pycharm does not provide any python versions. Solution: Completely erase the project including the folder in PycharmProjects and create it from scratch. If does not help, install desired python version yourself.
Footnotes
-
All homework deadlines are due at 23:59 on the specified date. ↩