Skip to content

Releases: Eventual-Inc/Daft

v0.1.18

26 Sep 01:17
3403c0c
Compare
Choose a tag to compare

Changes

✨ New Features

👾 Bug Fixes

📖 Documentation

  • [BUG] [Docs] Allow source code discovery to fail silently for pyo3-defined classes when generating docs. @clarkzinzow (#1430)
  • [FEAT] Implement .dt.year/month/day for timestamp types @jaychia (#1385)

🧰 Maintenance

v0.1.17

12 Sep 06:39
601260b
Compare
Choose a tag to compare

Changes

✨ New Features

🚀 Performance Improvements

👾 Bug Fixes

  • [BUG] Respect multithreaded_io flag when reading parquet @samster25 (#1359)
  • [BUG] Schema Display should use dtype Display instead of Debug @jaychia (#1355)
  • [BUG] propagate parquet io error instead of panicking @samster25 (#1352)

🧰 Maintenance

  • [CHORE] [New Query Planner] Add simple df.explain() option; change to fixed-point policy for rule batch @clarkzinzow (#1354)
  • [CHORE] Add status code to IO integration tests @jaychia (#1356)
  • [CHORE] Fix List/FixedSizeList DataType to hold a dtype instead of Field @jaychia (#1351)
  • [CHORE] Add Series::full_null/empty/from_arrow to reduce code duplication @jaychia (#1331)
  • [CHORE] Add a Growable factory method @jaychia (#1330)
  • [CHORE] Add new ListArray @jaychia (#1329)

⬆️ Dependencies

5 changes

v0.1.16

06 Sep 02:07
bdc4ba4
Compare
Choose a tag to compare

Changes

✨ New Features

👾 Bug Fixes

  • [BUG] Fix Table.read_parquet behavior when it encounters arrow_schema @jaychia (#1336)
  • [BUG] [New Query Planner] Revert file info partition column names. @clarkzinzow (#1333)
  • [BUG] Fix fixed size list array FullNull implementation @jaychia (#1320)

🧰 Maintenance

  • [CHORE] install perl before maturin @samster25 (#1345)
  • [CHORE] Switch to openssl @samster25 (#1344)
  • [CHORE] [New Query Planner] pyo3-agnostic LogicalPlanBuilder, op constructor arg orderings @clarkzinzow (#1332)
  • [CHORE] factor io config into common code @samster25 (#1335)
  • [CHORE] [New Query Planner] Remove ExpressionsProjection from builder, move validation into Op::try_new() @clarkzinzow (#1327)
  • [CHORE] StructArray refactors @jaychia (#1326)
  • [CHORE] drop flag for non native compile for daft profiling @samster25 (#1323)
  • [CHORE] pin pyarrow to 12 for ray compat tests @samster25 (#1322)
  • [CHORE] Move FixedSizeListArray to array/fixed_size_list_array.rs @jaychia (#1319)
  • [CHORE] Add fix for list schema inference tests using PyArrow 13.0.0 @jaychia (#1318)
  • [CHORE] Implementations of FixedSizeListArray @jaychia (#1281)

⬆️ Dependencies

v0.1.15

28 Aug 06:43
59ed92a
Compare
Choose a tag to compare

Changes

✨ New Features

  • [FEAT] add row group support to daft parquet reader @samster25 (#1308)
  • [FEAT] [New Query Planner] Add logical plan hashing, rule batches, fixed-point policies, early optimizer termination, and optimization cycle detection. @clarkzinzow (#1292)

👾 Bug Fixes

🧰 Maintenance

  • [CHORE] Refactor Growable traits and downcast for lifetimes @jaychia (#1305)
  • [CHORE] Refactor broadcast to use growables @jaychia (#1304)
  • [CHORE] Code reduction in growable macros + logical if/else refactor @jaychia (#1301)
  • [CHORE] Refactor growables to return a Series instead of concrete arrays @jaychia (#1297)
  • [CHORE] Minor cleanup for logical_plan::Project @xcharleslin (#1299)

v0.1.14

24 Aug 23:35
7fa9e64
Compare
Choose a tag to compare

Changes

✨ New Features

  • [FEAT] add flag to use multithreaded io for parquet_read_table @samster25 (#1298)
  • [FEAT] Add Retry Mode, connection timeout, and read timeout to S3Config @samster25 (#1293)
  • [FEAT] [New Query Planner] Add optimization framework and PushDownFilter rule. @clarkzinzow (#1284)

👾 Bug Fixes

🧰 Maintenance

v0.1.13

18 Aug 22:25
857162c
Compare
Choose a tag to compare

Changes

✨ New Features

  • [FEAT] Add Flag to_arrow to convert large string arrays @samster25 (#1283)

👾 Bug Fixes

  • [BUG] try release profile rather than dev-bench for daft profiling @samster25 (#1280)

🧰 Maintenance

  • [CHORE] reduce severity of region reroute logs to debug @samster25 (#1279)

v0.1.12

16 Aug 17:01
2b82afc
Compare
Choose a tag to compare

Changes

✨ New Features

👾 Bug Fixes

  • [BUG] Use manylinux_2_24 for aarch64 linux to be able to publish manylinux2014 @samster25 (#1275)

📖 Documentation

  • [FEAT] [New Query Planner] Support for Ray runner in new query planner. @clarkzinzow (#1265)

🧰 Maintenance

  • [CHORE] Refactor arrays to share a FromArrow constructor trait @jaychia (#1276)
  • [CHORE] Bump rust nightly channel date @jaychia (#1255)

⬆️ Dependencies

4 changes

v0.1.11

11 Aug 19:36
48b46b3
Compare
Choose a tag to compare

Changes

✨ New Features

  • [FEAT] [New Query Plan] Add support for Projection and Coalesce, enable many tests @clarkzinzow (#1256)
  • [FEAT] [New Query Planner] Add support for Concat. @clarkzinzow (#1254)
  • [FEAT] [New Query Planner] Add support for tabular writes. @clarkzinzow (#1252)
  • [FEAT] Multi-partition aggregate; Coalesce @xcharleslin (#1249)
  • [FEAT] [New Query Planner] Add support for Sort, Repartition, and Distinct in new query planner. @clarkzinzow (#1248)
  • [FEAT] Add Azure Support for Native Downloader @samster25 (#1250)
  • [FEAT] Locally unique semantic IDs for Expressions @xcharleslin (#1243)
  • [FEAT] Read parquet tables with int96 coercion option @jaychia (#1231)
  • [FEAT] [New Query Plan] Add support for CSV scans, JSON scans, in-memory scans and caching materialized results. @clarkzinzow (#1246)
  • [FEAT] Native Downloader add Retry Config parameters @samster25 (#1244)
  • [FEAT] (Single partition only) DataFrame.sum() via Rust planner @xcharleslin (#1230)
  • [FEAT] [New Query Planner] Logical --> physical translation, physical plan execution. @clarkzinzow (#1232)
  • [FEAT] native parquet correctness checks @samster25 (#1225)
  • [FEAT] add session token as input to io config @samster25 (#1224)

🚀 Performance Improvements

👾 Bug Fixes

📖 Documentation

🧰 Maintenance

  • [CHORE] Update test to only use store_schema kwarg for pa>=11 @jaychia (#1253)
  • [FEAT] (Single partition only) DataFrame.sum() via Rust planner @xcharleslin (#1230)
  • [CHORE] [New Query Planner] Introduce LogicalPlanBuilder and QueryPlanner interfaces to hide query planner implementations. @clarkzinzow (#1245)
  • [CHORE] LogicalPlan: Add display improvements, and Filter @xcharleslin (#1221)
  • [CHORE] Add unit tests for int96 timestamps @jaychia (#1229)
  • [DOCS][CHORE] Add docs for IOConfig and S3Config @jaychia (#1227)
  • [CHORE] disable mac test for lack of docker @samster25 (#1223)
  • [CHORE] Begin integrating Rust Logical Plan with Dataframe API @xcharleslin (#1207)
  • [CHORE] integration tests for nightly platform wheels @samster25 (#1219)
  • [CHORE] Remove existing LogicalPlan from all execution concepts @xcharleslin (#1208)
  • [CHORE] Add endpoints to simulate rate-limiting on AWS S3 buckets @jaychia (#1220)
  • [CHORE] Add pytest marker for integration @jaychia (#1211)
  • [CHORE] Add s3 fixtures for retrying logic @jaychia (#1206)
  • [CHORE] Add developer flag to use Rust query planner @xcharleslin (#1205)
  • [CHORE] Rust Logical plan skeleton @xcharleslin (#1192)

⬆️ Dependencies

7 changes

v0.1.10

31 Jul 18:36
bc11e57
Compare
Choose a tag to compare

Changes

✨ New Features

  • [FEAT] Enable feature-flagged native downloader in daft.read_parquet @jaychia (#1190)
  • [FEAT] parquet reader refactor, add parquet_stats_reader and parquet_schema_reader (1/2) @samster25 (#1191)

🚀 Performance Improvements

🧰 Maintenance

⬆️ Dependencies

6 changes

v0.1.9

25 Jul 04:16
751b839
Compare
Choose a tag to compare

Changes

🏆 Highlights

  • [FEAT] [Tensor] Add support for Tensor and FixedShapeTensor types. @clarkzinzow (#1073)

✨ New Features

🚀 Performance Improvements

  • [PERF] Simple Read Planner and RangeReader for Native Parquet Reader @samster25 (#1172)

👾 Bug Fixes

📖 Documentation

🧰 Maintenance

⬆️ Dependencies

10 changes