Skip to content

Conversation

@nagapavithrampl
Copy link

This pull request contains my completed Week 6 deliverables for the Data Glacier Virtual Internship in Data Analytics. The submission includes:

  • ✅ Cleaned and sampled dataset (yellow_tripdata_2023_pipe.gz) under 50MB
  • ✅ YAML schema file describing column types and formats
  • ✅ Jupyter Notebook with data validation, sampling, and export logic
  • ✅ Final PDF report with screenshots and documented steps

All files are located in the Week6/ folder. Please review and let me know if any updates are needed.

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant