Skip to content

CodeSolid/large-datasets

Repository files navigation

Python Large Datasets

This is the companion source for the article Large Data Sets in Python: Pandas and the Alternatives.

Installing the tools:

This project was originally created as follows, but see also the requirements.txt and environment.yml for more precise versions:

conda create -n python3.11 python=3.11
conda activate python3.11
python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install jupyterlab ipython pandas dask polars fastparquet

The main thing to see here is the Pandas.ipynb notebook, which contains much of the source for the articles.

The polars_rust folder is experimental and may be the subject of a future article on rustassured.com.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published