👨‍💻 Inventory Forecasting

📌 Project Architecture

👋 Introduction

This repository contains an end-to-end data science project that demonstrates the complete data science process from data collection to deployment. The project aims to provide insights into a real-world problem and solve it using data science techniques.

📝 Project Description

The project is focused on forecasting quantity for a supermarket. The data used in this project has been collected from [data source]. The data has been cleaned, preprocessed and analyzed to obtain insights and to build a predictive model. The predictive model has been built using facebook's prophet algorithm. The model has then been deployed using Jenkins and Looker Studio, formerly Google Data Studio.

💼 📝 Requirements

library	Description
`mysql-connector-python`	A library that provides connectivity to MySQL databases.
`matplotlib`	A plotting library for creating visualizations in Python
`pandas`	A library for data manipulation and analysis.
`python-dotenv`	A library for working with environment variables stored in a .env file.
`pathlib`	A library for working with file system paths.
`argparse`	A library for parsing command line arguments.
`os`	A library for interacting with the operating system.
`yaml`	A library for parsing YAML files.
`typing`	A library for supporting type hints in Python.
`gspread`	A library for working with Google Sheets.
`oauth2client.service_account`	A library for authenticating with Google APIs using a service account.
`prophet`	A library for time series forecasting developed by Facebook.
`io`	A library for working with I/O streams.
`importlib`	A library for programmatically importing modules.
`boto3`	A library for interacting with AWS services using Python.
`googleapiclient.discovery`	A library for discovering and using Google APIs.

🚀 Getting Started

To get started with this project, you'll need to clone the repository and make changes to the code. Here's how:

🔗 Cloning the Repository

Go to the repository's page on Github and click the "Code" button.
Select "HTTPS" or "SSH" as the clone method, depending on your preference.
Copy the URL provided.
Open a terminal or command prompt and navigate to the directory where you want to store the repository on your local machine.
Type "git clone" followed by the URL you copied in step 3.

For example, if the repository's URL is "https://github.com/user/repo.git" and you want to store the repository in a folder called "my-project" on your desktop, you would type the following command:

git clone https://github.com/user/repo.git ~/Desktop/my-project

🤝 Giving Credit

If you make changes to the code, it's important to give credit to the original project's author. You can do this by adding a note or comment to your code, or by including the original author's name and a link to the project in your documentation.

For example, if you add a new feature to the code, you could include a comment like this:

// New feature added by [your name]. Original code by [original author name].

// Link to original project: [link to original project]

💻 📝 🤔 Working with the Code

Once you have cloned the repository, you can start working with the code. Here are some tips to get you started:

Read the User Guide and code comments to understand how the code works.
Make changes to the code as needed, testing your changes to ensure they work correctly.
If you want to contribute your changes back to the original project, create a pull request on Github. Be sure to include a detailed description of your changes and why they are important or necessary.

📖 User Guide

STEP - 1️⃣ : Navigate to the project directory in the terminal by running

cd " YOUR FOLDER LOCATION "

This step is necessary to ensure that you are in the correct directory where the project files are located.

STEP - 2️⃣ : Activate the project environment by running

conda activate End2End

This step is necessary to activate the Conda environment that contains all the required libraries and dependencies for the project.

STEP - 3️⃣ : Create the database by running the below code

python src\tools\database_final.py -cd True -nd "YourDatabaseName"

This step creates the database with the specified name. The -cd argument specifies whether to create or drop the database, and -nd specifies the name of the database.

STEP - 4️⃣ : Load the data into the database by running

python src\tools\database_final.py -nd "YourDatabaseName" -id upload-to-database

This step loads the raw data into the database. The -nd argument specifies the name of the database, and -id specifies the operation to be performed (in this case, uploading data to the database).

STEP - 5️⃣ : Run the ETL script to transform the data by running

python src\tools\etl_script_final.py

This step performs the ETL (Extract-Transform-Load) process to transform the raw data into a format suitable for modeling.

STEP - 6️⃣ : Create the cleaned database by running

python src\tools\database_final.py -cd True -nd "CleanedDatabaseName"

This step creates a new database with the cleaned data.

STEP - 7️⃣ : Load the cleaned data into the database by running

python src\tools\database_final.py -nd "CleanedDatabaseName" -id cleaned-upload-to-database

This step loads the cleaned data into the database.

STEP - 8️⃣ : Run main.py with task parameter

python main.py -t sql_python

This will execute the "main.py" script with the "sql_python" task parameter, triggering the SQL query and export process resulting output file (df) should be uploaded to the specified S3 bucket after the script completes execution.

STEP - 9️⃣ : Run the final modeling script by running the code

python main.py -t modeling_final

This step runs the final modeling script to build a predictive model based on the cleaned data. The -t argument specifies the type of script to run, and "modeling_final" is the name of the script and uploads the Predictions to a Google Sheet

🤝💭 Contributing

If you would like to contribute to this project, please create a pull request on an other branch so that we don't mess up main code: Pull Request

🔚 Conclusion

This project showcases the ability to handle real-world data and solve a problem using data science techniques. The project can be used as a reference for building similar projects in the future. Feel free to use the code and make modifications as per your requirements and don't forget to give credit.

💡 Created and Contributed by

👨‍💻 Pradeepchandra Reddy S C

👨‍💻 Data To Production (Mitul Patel Msc - Mentor)

Name		Name	Last commit message	Last commit date
Latest commit History 178 Commits
Visualisations		Visualisations
assets		assets
config		config
data		data
src/tools		src/tools
.gitignore		.gitignore
README.md		README.md
main.py		main.py
predictions.csv		predictions.csv
requirements.txt		requirements.txt

soopertramp/inventory_forecasting_end2end

Folders and files

Latest commit

History

Repository files navigation

👨‍💻 Inventory Forecasting

📌 Project Architecture

👋 Introduction

📝 Project Description

💼 📝 Requirements

🚀 Getting Started

🔗 Cloning the Repository

🤝 Giving Credit

💻 📝 🤔 Working with the Code

📖 User Guide

STEP - 1️⃣ : Navigate to the project directory in the terminal by running

This step is necessary to ensure that you are in the correct directory where the project files are located.

STEP - 2️⃣ : Activate the project environment by running

This step is necessary to activate the Conda environment that contains all the required libraries and dependencies for the project.

STEP - 3️⃣ : Create the database by running the below code

This step creates the database with the specified name. The -cd argument specifies whether to create or drop the database, and -nd specifies the name of the database.

STEP - 4️⃣ : Load the data into the database by running

This step loads the raw data into the database. The -nd argument specifies the name of the database, and -id specifies the operation to be performed (in this case, uploading data to the database).

STEP - 5️⃣ : Run the ETL script to transform the data by running

This step performs the ETL (Extract-Transform-Load) process to transform the raw data into a format suitable for modeling.

STEP - 6️⃣ : Create the cleaned database by running

This step creates a new database with the cleaned data.

STEP - 7️⃣ : Load the cleaned data into the database by running

This step loads the cleaned data into the database.

STEP - 8️⃣ : Run main.py with task parameter

This will execute the "main.py" script with the "sql_python" task parameter, triggering the SQL query and export process resulting output file (df) should be uploaded to the specified S3 bucket after the script completes execution.

STEP - 9️⃣ : Run the final modeling script by running the code

This step runs the final modeling script to build a predictive model based on the cleaned data. The -t argument specifies the type of script to run, and "modeling_final" is the name of the script and uploads the Predictions to a Google Sheet

🤝💭 Contributing

🔚 Conclusion

💡 Created and Contributed by

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages