General csv, excel files, or pandas DataFrame exploration scripts and python utils. Includes descriptive statistics, versatile hypothesis testing and correlations analysis.
Clone the repository
git clone https://github.com/wajsbrot/datascribe.gitInstall the package from the datascribe folder
pip install -e .DataScribe contains both functions for use as a package and scripts for creating csv plot or summaries.
Package use in a python code
from datascribe.stats import compare_common_columns
df = compare_common_columns(df_a, df_b)Scripts usage
audit.py my_file.csvTo use scripts one should add the scripts folder to the PATH environment variable, and to make the scripts executable with
chmod +x /path_to_datascribe/scripts/*