Check it out here.
Data Merger is a web application that allows users to merge multi level datasets using a common unique identifier. The app supports uploading Excel and CSV files and provides the merged dataset as a downloadable CSV or Excel file with three sheets: outer, inner, and merged data.
- Upload multiple Excel (
.xlsx
) or CSV (.csv
) files. - Select the inner and outer datasets from the uploaded files.
- Choose the unique identifier columns for both datasets from a dropdown list.
- Check whether the unique identifier columns have the same data type.
- Display the number of matches made based on the unique identifier.
- Download the merged dataset as a CSV or Excel file.
-
Clone this repository or download the source code.
-
Navigate to the project directory and create a virtual environment:
python -m venv venv
-
Activate the virtual environment:
-
For Windows:
venv\Scripts\activate
-
For macOS and Linux:
source venv/bin/activate
-
-
Install the required packages:
pip install -r requirements.txt
-
Run the Streamlit app:
streamlit run app.py
-
Open the displayed URL in your web browser to access the app.
- Upload Excel or CSV files containing the datasets you want to merge.
- Select the inner and outer datasets from the uploaded files.
- Choose the unique identifier columns for both datasets.
- View the merged dataset and the number of matches made based on the unique identifier.
- Download the merged dataset as a CSV or Excel file.
- streamlit
- pandas
- openpyxl
- xlsxwriter
- streamlit-lottie
- requests