Web Scraper

A simple web-based scraping tool that allows you to extract data from websites using CSS selectors.

Getting Started

Clone the repository:

git clone https://github.com/puter-apps/scraper.git

and open the /src/index.html file in your browser.

How It Works

Web Scraper leverages Puter.js to overcome the fundamental challenge of cross-origin requests in web browsers. Traditional web applications are restricted by CORS (Cross-Origin Resource Sharing) policies, which prevent direct requests to external domains from browser-based JavaScript.

Specifically, Web Scraper uses puter.net.fetch() to make cross-origin HTTP requests and bypass CORS restrictions without needing a proxy server. This allows the app to scrape public websites without server-side configuration.

The scraped HTML is then parsed using the browser's built-in DOMParser API, and data is extracted using standard CSS selectors via querySelectorAll().

License

MIT

Name	Name	Last commit message	Last commit date
Latest commit History 7 Commits
src	src
.gitattributes	.gitattributes
LICENSE	LICENSE
README.md	README.md
screenshot.png	screenshot.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Web Scraper

Getting Started

How It Works

License

About

Uh oh!

Releases

Packages

Languages

Search code, repositories, users, issues, pull requests...

License

Puter-Apps/scraper

Folders and files

Latest commit

History

Repository files navigation

Web Scraper

Getting Started

How It Works

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages