Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Latest commit

 

History

History
History
28 lines (24 loc) · 1.4 KB

File metadata and controls

28 lines (24 loc) · 1.4 KB
Copy raw file
Download raw file
Outline
Edit and raw actions

Wikipedia Population Table Data Extraction

This project demonstrates web scraping using Selenium WebDriver to extract data from an HTML table on a Wikipedia page. Specifically, it retrieves the list of countries and territories by total population, presenting a real-world example of data extraction and automation using Selenium.

Features

  • Automates navigation to a Wikipedia page on country populations.
  • Extracts data from an HTML table, including:
    • Location (Country or territory)
    • Population
    • Percentage of world population
    • Date of population data
    • Source of the data
    • Notes
  • Processes table rows dynamically to handle updates to the table structure or content.
  • Uses JavaScript for smooth scrolling to the target table.

Prerequisites

Ensure you have the following before running the project:

  1. Java Development Kit (JDK) - Version 8 or above.
  2. Google Chrome - Latest stable version.
  3. ChromeDriver - Version compatible with your Chrome browser.
  4. Selenium WebDriver - Included in the project dependencies.

Technologies Used

  • Java - The programming language for the project.
  • Selenium WebDriver - For web element interaction and automation.
  • Google Chrome & ChromeDriver - For browser-based automation.
  • JavaScript Executor - For advanced browser interactions like scrolling.
Morty Proxy This is a proxified and sanitized view of the page, visit original site.