Skip to content

wenceslasdk/data-science-2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

196 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science 2 – NMFP436

Lectures: Karel Kozmík, Ondřej Týbl

Practicals: Karel Kozmík, Ondřej Týbl

Contact:

This repository contains materials to the Data Science 2 - NMFP436 course.

Lectures plan

Date Topic Lecturer
17.2.2026 Intro Karel
24.2.2026 Decision Trees I Vaclav
24.2.2026 Decision Trees II Vaclav
10.3.2026 Decision Trees III Karel
17.3.2026 Decision Trees IV Karel
24.3.2026 Neural Networks I Ondřej
31.3.2026 Neural Networks II Ondřej
7.4.2026 Neural Networks III Ondřej
14.4.2026 Neural Networks IV Ondřej
21.4.2026 Neural Networks V Ondřej
28.4.2026 Neural Networks VI Ondřej
5.5.2026 Neural Networks VII Ondřej
12.5.2026 TBA TBA
19.5.2026 Exam TBA

Practicals plan

Date Topic Lecturer
17.2.2026 Intro + Environment setup Karel
3.3.2026 Data Science Basics I Ondřej
3.3.2026 Data Science Basics II Ondřej
10.3.2026 Decision Trees I Karel
17.3.2026 Decision Trees II Karel
24.3.2026 Decision Trees III Karel
31.3.2026 Hyperparameters Optimization Karel
7.4.2026 Neural Networks I Ondřej
14.4.2026 Neural Networks II Ondřej
21.4.2026 Neural Networks III Ondřej
28.4.2026 Neural Networks IV Ondřej
5.5.2026 Neural Networks V Ondřej
12.5.2026 Neural Networks VI Ondřej
19.5.2026 Exam TBA

To receive the course credit, students must successfully work out two home assignments, one will be focused on decision trees and the other one on neural networks. There are only two assignments, but they will be complex and require considerable amount of work. Details will be published later in the semester. The course credit is a necessary requirement to take the final exam.

We use git to store all course files

How to set-up your python environment

The following instructions will guide you to set-up everything needed to run the course code. Long story short, we use virtual environment managed by poetry running on python3.10 to install all dependencies from the lock file provided.

1) Install PyCharm Professional and git

Note: it is not mandatory to use the Professional version, Community edition is also fine. The Professional version has some nice features like integrated Jupyter or automatic Python download.

2) Set up GitHub

3) Get the course files - several options ordered by level of recommendation

3.1) Use PyCharm integrated version control

  • PyCharm has an integration to GitHub
  • Click File -> Project from Version Control (or in the startup window you will see something like "Clone repository" or "VCS")
  • Clone your repository by selecting it in the version control and create a project from this folder
  • Do not clone the repository into a cloud folder! Use 'users//repos/data-science-2' foe example
  • This will create a project without any Interpreter
  • In the bottom right, click on the , select "Add New Interpreter" -> "Add Local Interpreter"
  • Create a new virtual environment with Python 3.10 (if not present, it will be downloaded)

3.2) Manually

  • Using the command line, clone your forked repository
cd C:\Users\tyblondr
git clone https://github.com/wenceslasdk/data-science-2.git
  • in my example the course repository would be C:\Users\tyblondr\data-science-2

  • now to create the Project

  • open PyCharm and click 'New Project'

  • set 'Name' to 'data-science-2' and 'Location' to the parent directory of your cloned repository from 2), i.e. in the example above we would set 'Location' to '\Users\tyblondr'

  • choose 'Python version' as 'Python 3.10' (will be downloaded and installed if it does not exist yet on your computer)

  • click 'Create' and choose 'Create from Existing Resources' if you are asked

3.3) Download the repository .zip from GitHub

  • if everything fails, you can always just download the files
  • click on the button <> Code, then Download ZIP

4) Install dependencies

  • open Terminal in PyCharm (one of the icons in the left-down corner)
  • make sure there is (.venv) at the beginning of your command line, denoting you are now in the activated virtual environment, that you created in the previous steps
  • install poetry by typing
pip install poetry
  • install remaining packages using poetry (we use --no-root to indicate that the environment itself already exists)
poetry install --no-root

Note for Conda Users: If you are using conda on your system, it will probably happen that your PyCharm terminal will be always opened with both base and the project virtual env being activated. This should not be a problem but we advice to double-check that 'which python' points to the python within the project and not the one in your conda installation.

5) Run Jupyter

  • You can run Jupyter directly inside PyCharm
  • Alternatively you can run Jupyter Lab by typing this into the terminal:
jupyter lab

Keeping the fork updated

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors