In this repository, I provide apps and scripts for converting CSV files to Parquet using Python. See the modes below and use the one that is suitable for your application.
Watch the YouTube video demonstration of this case here: soon
Aumentando a performance convertendo arquivos CSV por Parquet no Power BI
An .exe application that does not require installation, just download, configure the parameters, and run.
An .EXE file that can be executed via command prompt and, best of all, can be configured in Windows scheduled tasks.
Configure in Actions the full path of the executable file, for example:
C:\Users\aliso\OneDrive\Documentos\GitHub\csv-to-parquet\exe\csv_to_parquet_cli.exe
You can use options as:
-i "C:\Users\aliso\OneDrive\Documentos\GitHub\csv-to-parquet\sample_dataset_csv" -o "C:\Users\aliso\OneDrive\Documentos\GitHub\csv-to-parquet\sample_dataset_parquet" -e ".csv" -enc "utf-8" -s ";" -hd 0
You can run on powershell or configure a .bat file to run manually too
cd C:\Users\aliso\OneDrive\Documentos\GitHub\csv-to-parquet\exe
csv_to_parquet_cli.exe -i "C:\Users\aliso\OneDrive\Documentos\GitHub\csv-to-parquet\sample_dataset_csv" -o "C:\Users\aliso\OneDrive\Documentos\GitHub\csv-to-parquet\sample_dataset_parquet" -e ".csv" -enc "utf-8" -s ";" -hd 0
Unlike the compiled versions above, this is a Python script that can be run in VSCode, Python interpreters, Notebooks, etc. You choose. Completely open, it gives you the freedom to create new implementations and other automations.
Version | Updated | Link | Release Note |
---|---|---|---|
v1.1.0 | 2025-01-12 20:05:00 | Csv to Parquet v1.1.0 | Included xlsx extension |
v1.0.0 | 2025-01-11 15:49:00 | Csv to Parquet v1.0.0 | Initial version |