Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Puter-Apps/scraper

Open more actions menu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A simple web-based scraping tool that allows you to extract data from websites using CSS selectors.

Screenshot


Getting Started

Clone the repository:

git clone https://github.com/puter-apps/scraper.git

and open the /src/index.html file in your browser.


How It Works

Web Scraper leverages Puter.js to overcome the fundamental challenge of cross-origin requests in web browsers. Traditional web applications are restricted by CORS (Cross-Origin Resource Sharing) policies, which prevent direct requests to external domains from browser-based JavaScript.

Specifically, Web Scraper uses puter.net.fetch() to make cross-origin HTTP requests and bypass CORS restrictions without needing a proxy server. This allows the app to scrape public websites without server-side configuration.

The scraped HTML is then parsed using the browser's built-in DOMParser API, and data is extracted using standard CSS selectors via querySelectorAll().


License

MIT

About

A simple app to scrape data from websites using CSS selectors.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
Morty Proxy This is a proxified and sanitized view of the page, visit original site.