Pre-processing data often simplifies ML training. This page is for highlighting Rust libraries that aid in pre-processing data via cleaning, normalization, transformation, as well as feature extraction and selection.
Data Processing
csv [ crate · repo · docs ]
Fast CSV parsing with support for serde.
Last Commit: 2024-11-09
Last Published: 2024-11-09
arrow [ crate · repo · docs ]
Rust implementation of Apache Arrow
Last Commit: 2024-11-24
Last Published: 2024-11-20
datafusion [ crate · repo · docs ]
DataFusion is an in-memory query engine that uses Apache Arrow as the memory model
Last Commit: 2024-11-24
Last Published: 2024-11-08
polars [ crate · repo · docs ]
DataFrame library based on Apache Arrow
Last Commit: 2024-11-24
Last Published: 2024-11-01
tokenizers [ crate · repo · docs ]
Provides an implementation of today's most used tokenizers, with a focus on performances and versatility.
Last Commit: 2024-11-24
Last Published: 2024-11-05
hdf5 [ crate · repo · docs ]
Thread-safe Rust bindings for the HDF5 library.
Last Commit: 2024-08-08
Last Published: 2021-11-21
timely [ crate · repo · docs ]
A low-latency data-parallel dataflow system in Rust
Last Commit: 2024-11-24
Last Published: 2024-11-12
ai-dataloader [ crate · repo · docs ]
Rust implementation to the PyTorch DataLoader
Last Commit: 2024-11-11
Last Published: 2024-09-14
cuticula [ crate · repo · docs ]
Data Preprocessing library for Machine Learning
Last Commit: 2017-07-07
Last Published: 2016-04-18
rettle [ crate · repo · docs ]
A multithreaded ETL with inspiration drawn from Keras.
Last Commit: 2020-07-04
Last Published: 2020-01-08
arff [ crate · repo · docs ]
ARFF file format serializer and deserializer
Last Commit: 2019-07-26
Last Published: 2018-06-20
rusty-data [ crate · repo · docs ]
A data handling library (designed for machine learning).
Last Commit: 2016-01-27
Last Published: 2016-01-27
openml [ crate · repo · docs ]
A rust interface to [OpenML](http://openml.org/).
Last Commit: 2019-07-13
Last Published: 2018-06-27
Do you know of a crate that is missing?
Create an issue
or a PR.
Looking for something you didn't find? Try asking on Zulip chat.
Looking for something you didn't find? Try asking on Zulip chat.