About this project
it-programming / data-science-1
Open
We are seeking an experienced freelancer to design, develop, and implement an automated ETL (Extract, Transform, Load) data pipeline. The primary goal is to consolidate sales and customer data from disparate sources into a centralized data warehouse and visualize key performance indicators through an interactive Power BI dashboard. The project involves several key phases: Data Extraction: Collect raw sales and customer data from various sources, including CSV files and a MySQL database. Data Transformation: Clean, preprocess, and transform the extracted datasets using Python, specifically leveraging the Pandas library for efficient data manipulation. This includes handling missing values, data type conversions, and structuring data for analysis. Data Loading: Load the cleaned and transformed data into a robust and scalable data warehouse solution. Data Visualization: Connect the structured data from the data warehouse to Power BI to create a comprehensive and interactive dashboard. The dashboard should feature essential KPIs related to sales performance, customer trends, and other relevant business metrics to provide actionable insights. The ideal candidate will have strong expertise in data engineering, database management, Python programming for data processing, and Power BI dashboard development. The ability to create a reliable and automated pipeline is crucial for this project.
Category IT & Programming
Subcategory Data Science
Project size Large
Project duration Not specified
Skills needed