Skip to content

drashti4/learnds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Trade Data Analysis

A data science project for analyzing international trade data, including imports and exports across different countries, products, and modes of transport. This project demonstrates data cleaning, exploratory data analysis, and interactive visualization using Python.

📊 Project Overview

This project processes and analyzes trade data to uncover insights about:

  • Top imported and exported products by country
  • Trade patterns across different transportation modes
  • Comparative trade statistics (imports vs. exports) by country

📁 Project Structure

learnds/
├── README.md                          # This file
├── requirements.txt                   # Python dependencies
├── data/
│   └── processed_trade.csv           # Cleaned trade data
└── src/
    ├── analysis/
    │   ├── process_data.py           # Data cleaning and preprocessing
    │   ├── imported_products.py      # Analysis of top imported products
    │   ├── mode_distribution.py      # Trade by transportation mode analysis
    │   └── overall_trade.py          # Overall import/export comparison
    └── visualization/
        ├── imported_products.html    # Interactive product visualization
        ├── mode_distribution.html    # Transportation mode visualization
        └── overall_trade.html        # Trade comparison visualization

🔧 Prerequisites

  • Python 3.7+
  • pip or conda package manager

📦 Installation

  1. Clone or download this repository:
cd learnds
  1. Install required dependencies:
pip install -r requirements.txt

Dependencies

  • pandas - Data manipulation and analysis
  • bokeh - Interactive data visualization
  • squarify - Treemap visualization library

🚀 Usage

Step 1: Process the Data

Run the data cleaning script to prepare raw trade data:

python src/analysis/process_data.py

This script:

  • Loads raw trade data from data/trade.csv
  • Converts date formats
  • Filters data from 2024 onwards
  • Removes unnecessary columns
  • Cleans entries with zero values
  • Exports cleaned data to data/processed_trade.csv

Step 2: Run Analyses

Generate visualizations by running the analysis scripts:

Top Imported Products Analysis:

python src/analysis/imported_products.py

Outputs: src/visualization/imported_products.html

Transportation Mode Distribution:

python src/analysis/mode_distribution.py

Outputs: src/visualization/mode_distribution.html

Overall Trade Comparison:

python src/analysis/overall_trade.py

Outputs: src/visualization/overall_trade.html

Step 3: View Results

Open the generated HTML files in your web browser to explore interactive visualizations.

📈 Analysis Descriptions

Imported Products (imported_products.py)

Identifies and visualizes the top 20 most imported products across different countries. Creates an interactive chart showing:

  • Top 20 products by import value
  • Top 10 countries contributing to imports
  • Product codes and names with import values in millions

Mode Distribution (mode_distribution.py)

Analyzes trade patterns by transportation mode. Currently focuses on:

  • Top 10 products imported by the United States via Road transport
  • Pie chart visualization showing the distribution and percentage of each product

Overall Trade (overall_trade.py)

Provides a comparative view of imports and exports:

  • Grouped by country and trade type (Import/Export)
  • Side-by-side bar chart comparing import and export values
  • Interactive hover tooltips with formatted values

📊 Visualizations

All visualizations are created using Bokeh and saved as interactive HTML files. Features include:

  • Hover tooltips for detailed information
  • Interactive tools (zoom, pan, reset)
  • Responsive design
  • Professional styling

📝 Data Format

The processed trade data includes the following columns:

  • DATE - Reference date in MM/YYYY format
  • COUNTRY - Principal trading partner country
  • MODE - Mode of transport (Road, Air, Rail, etc.)
  • PRODUCT - Product classification with code
  • Trade - Trade type (Import/Export)
  • VALUE - Trade value or volume

🔍 Key Features

  • Automated Data Cleaning - Handles missing values, formatting, and filtering
  • Interactive Visualizations - Bokeh-based charts for data exploration
  • Scalable Analysis - Easy to extend with additional analyses
  • Clean Code Structure - Modular functions for data processing and visualization

💡 Future Enhancements

  • Add time series analysis of trade trends
  • Implement correlation analysis between products
  • Create dashboard combining multiple visualizations
  • Add forecasting models for future trade patterns
  • Include geographic mapping of trade data

📄 License

This project is available for educational purposes.

👨‍💻 Author

Data Science Learning Project

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors