Metadata-Version: 2.1
Name: data-source-faker
Version: 0.0.2
Summary: 
Author: dfunto
Author-email: dadutra2@gmail.com
Requires-Python: >=3.10,<4.0
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Requires-Dist: boto3 (>=1.35.81,<2.0.0)
Requires-Dist: faker (>=33.1.0,<34.0.0)
Requires-Dist: fastparquet (>=2024.11.0,<2025.0.0)
Requires-Dist: pandas (>=2.2.3,<3.0.0)
Requires-Dist: pyarrow (>=18.1.0,<19.0.0)
Requires-Dist: pyyaml (>=6.0.2,<7.0.0)
Requires-Dist: schedule (>=1.2.2,<2.0.0)
Description-Content-Type: text/markdown

# Data Source Faker
Generates fake source data for testing data ingestion pipeline


The output folder can be set to remote cloud storages as well, given that the necessary credentials are setup. 
Pandas takes care of the file writing, follow [their docs](https://pandas.pydata.org/docs/user_guide/io.html#reading-writing-remote-files) on how to authenticate and setup the `data_output` parameter to the remote path (e.g. s3://...)


## Run Unit Tests



