126 shaares
1 private link
1 private link
3 results
tagged
data
This project makes it easy to analyze the Python ecosystem by providing of all the code ever published to PyPI via git, parquet datasets with file metadata, and a set of tools to help analyze the data.
Thanks to the power of git the contents of PyPI takes up only 439.4 GB on disk, and thanks to tools like libcst every Python file can be analysed on a consumer-grade laptop in a few hours.
Download the 2024 Environmental Report. This report charts our progress and methodology, and shares knowledge and insights for others.