-
Notifications
You must be signed in to change notification settings - Fork 707
Prepend prefix to to_parquet created files #622
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
Issue #, if available:
#617
Description of changes:
In the
to_parquet
method, whendataset=True
, created parquet file names are randomly generated based on a uuid hash. The aim of this PR is to prepend a prefix to these file names giving more control over the output parquet file names.An optional
filename_prefix
argument, is added to the method, set to None by default. If supplied and dataset is True, a prefix is prepended to the output parquet file names.Tested through
test_to_parquet_filename_prefix
.By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.