Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

chpmem/bgg_pull

Open more actions menu
 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bgg_pull

This module supports grabbing information from the board game geek website (https://boardgamegeek.com/), output is a csv file. I couldn't find a way to get games by rank through the api, so I scrape the game ranks then use the api to get more details on each game. The frist step is scraping, once the first 5K games are grabbed and saved out, user can call get from api to fill out all details

  • call ScrapeRanks, this should take a while (~10 min) and gets the urls and rank of all games from the site
  • call GetFromApi, this should also take a while (~10 min) and gets the remaining information through API calls

usage

From the command line use the following arguments, the output of this is a dataframe.

  • -s or --scrape to get the names and id of the top 5K games
  • -a or --api to get information about the top 5K games

About

This script uses a combination of web scraping and API use to get the top 5K board games from the website boardgamegeek.com

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%
Morty Proxy This is a proxified and sanitized view of the page, visit original site.