🤖 AI Browser Scraper with Python & GPT-4o

This repo shows how to build an AI-powered web scraper using Browser Use — a Python framework that lets a large language model (LLM) control your real browser.

It scrapes dynamic websites like Twitter (X), including pages that rely on JavaScript or require login sessions, and returns structured data with minimal code.

📎 Requirements

Python 3.8+
Access to an LLM API key (the example is using OpenAI GPT-4o)
Google Chrome installed

🚀 What This Project Does

Launches your actual Chrome browser (with your cookies & sessions)
Uses GPT-4o to navigate the web and extract content
Returns structured output using Pydantic models
Scrapes Apify’s latest tweets as a working example

🧱 Tech Stack

Python 3.8+
Browser Use
Playwright
OpenAI GPT-4o (requires an API key)
Pydantic (for typed outputs)

📦 Installation

git clone https://github.com/your-username/ai-browser-scraper.git
cd ai-browser-scraper
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt
playwright install

🔐 Setup

Create a .env file: touch .env
Paste your OpenAI API key inside the .env: OPENAI_API_KEY=sk-...
(Optional) Update your Chrome path inside main.py if you’re not on MacOS.

▶️ Run It

python main.py

You’ll see: • A Chrome browser window open on Apify’s X page • GPT-4o navigating and extracting data from the 3 most recent posts • Clean output saved to a JSON file:

🛠 Customize It

Want to scrape Instagram, Amazon, or Reddit instead? Just change the initial_actions URL and update the prompt.
Want to run it for multiple profiles? Wrap the task in a loop and switch URLs dynamically.
Want to extract more data (likes, timestamps)? Update the Pydantic model and your task prompt accordingly.

Name	Name	Last commit message	Last commit date
Latest commit History 3 Commits
.env.example	.env.example
README.md	README.md
main.py	main.py
requirements.txt	requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 AI Browser Scraper with Python & GPT-4o

📎 Requirements

🚀 What This Project Does

🧱 Tech Stack

📦 Installation

🔐 Setup

▶️ Run It

🛠 Customize It

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

PerVillalva/ai-browser-scraper

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Browser Scraper with Python & GPT-4o

📎 Requirements

🚀 What This Project Does

🧱 Tech Stack

📦 Installation

🔐 Setup

▶️ Run It

🛠 Customize It

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages