This project aims to analyze the top companies in India by leveraging various data processing and visualization tools. The analysis involves scraping data from the "Top Companies in India" Wikipedia page, storing and cleaning the data, and creating an interactive dashboard using Power BI.
-
Data Scraping
- Used Python libraries to scrape data from the Wikipedia page listing the top companies in India.
- Extracted information such as rank, company name, industry, revenue, and headquarters.
-
Data Storage
- Stored the scraped data in a Pandas DataFrame.
- Exported the DataFrame to a CSV file for further processing.
-
Data Import and Cleaning
- Imported the CSV file into Power BI.
- Cleaned and transformed the data to ensure accuracy and consistency.
-
Data Relationship and Analysis
- Established relationships between different data points.
- Conducted various analyses to gain insights into the top companies in India.
-
Dashboard Creation
- Created an interactive dashboard in Power BI.
- The dashboard provides visualizations and insights derived from the data.
- Python: For web scraping and data manipulation.
- Pandas: For data storage and exporting to CSV.
- Power BI: For data cleaning, relationship building, analysis, and dashboard creation.
This project provides a comprehensive analysis of the top companies in India, leveraging data scraping, data processing, and data visualization techniques. The interactive dashboard created in Power BI offers valuable insights into the performance and characteristics of these companies.
Feel free to explore the project and contribute to further improvements!