Releases: datachain-ai/datachain
Releases · datachain-ai/datachain
0.51.1
What's Changed
- Skills by @dmpetrov in #1650
- Skill readme by @dmpetrov in #1695
- Making
enlist_sourceusing ephemeral chain and add check for job to pre-exist if in studio by @ilongin in #1718 - Remove legacy compatibility code by @shcheklein in #1723
Full Changelog: 0.51.0...0.51.1
0.51.0
What's Changed
- build(deps-dev): bump mypy from 1.19.1 to 1.20.0 by @dependabot[bot] in #1714
- perf(metastore): eliminate O(N) merge_versions calls in list_datasets by @amritghimire in #1721
- feat!: remove deprecated APIs by @amritghimire in #1720
Breaking Changes
The deprecated functions and classes are removed.
DataChain class methods removed (use module-level functions)
| Removed | Replacement |
|---|---|
DataChain.from_storage() |
read_storage() |
DataChain.from_dataset() |
read_dataset() |
DataChain.from_json() |
read_json() |
DataChain.from_values() |
read_values() |
DataChain.from_pandas() |
read_pandas() |
DataChain.from_hf() |
read_hf() |
DataChain.from_csv() |
read_csv() |
DataChain.from_parquet() |
read_parquet() |
DataChain.from_records() |
read_records() |
DataChain.datasets() |
module-level datasets() |
DataChain.listings() |
module-level listings() |
DataChain instance methods removed
| Removed | Replacement |
|---|---|
DataChain.batch_map() |
DataChain.agg() |
DataChain.collect() |
DataChain.to_iter() |
UDF classes removed
| Removed | Replacement |
|---|---|
BatchMapper |
Aggregator |
File methods removed
| Removed | Replacement |
|---|---|
File.get_uri() |
file.get_fs_path() |
resolve(file) function |
file.resolve() |
Environment variables removed
| Removed | Replacement |
|---|---|
DVC_STUDIO_* |
DATACHAIN_STUDIO_* |
Full Changelog: 0.50.2...0.51.0
0.50.2
0.50.1
What's Changed
- UDF checkpoints for aggregator by @ilongin in #1593
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in #1711
- refactor(catalog): add remove_dataset_versions by @amritghimire in #1704
Full Changelog: 0.50.0...0.50.1
0.50.0
What's Changed
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in #1694
- Removing exact hash assertion to hash tests by @ilongin in #1697
- Using listing dataset
uuidfor hashing instead of listing URI by @ilongin in #1656 - fix: Type inference in column expression by @dmpetrov in #1693
- fix(group_by): support inline func expressions in partition_by without label by @dmpetrov in #1677
- fix(catalog): use for loop in remove_dataset to handle skipped versions by @amritghimire in #1703
Full Changelog: 0.49.1...0.50.0
0.49.1
0.49.0
What's Changed
- get-dataset metastore: exclude preview and versions by default by @shcheklein in #1661
Full Changelog: 0.48.4...0.49.0
0.48.4
What's Changed
- remove torch pin since torchcodec 0.11 got released by @shcheklein in #1685
- Fix yolo tests by @ilongin in #1686
- Refactor hash tests to not use exact hash values by @ilongin in #1684
- Reverse order remove dataset version by @ilongin in #1675
Full Changelog: 0.48.3...0.48.4
0.48.3
What's Changed
- fix(dataset_info): handle None attrs in DatasetInfo validator by @amritghimire in #1680
Full Changelog: 0.48.2...0.48.3
0.48.2
What's Changed
- Expose query_script via Python API and CLI by @dmpetrov in #1672
- fix(cli): use new output format in
datachain showcommand by @dmpetrov in #1668 - feat: support version in dataset name for read_dataset by @dmpetrov in #1670
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in #1673
- Cleanup temp datasets by @ilongin in #1631
Full Changelog: 0.48.1...0.48.2