GitHub - medialab/SearchEnginesBookmarklet: Extract list of results from search engines pages as CSV with a bookmarklet directly within the browser
Skip to content

Extract list of results from search engines pages as CSV with a bookmarklet directly within the browser

License

Notifications You must be signed in to change notification settings

medialab/SearchEnginesBookmarklet

Repository files navigation

SearchEnginesBookmarklet

Harvesting lists of urls, titles, dates and descriptions from a query on a search engine such as Google, DuckDuckGo, Baidu, Bing or Qwant is a recurrent need in digital methods and a hardly automatable one because of those website's restrictions towards robots.

SearchEnginesBookmarklet is a low tech solution to this need by offering you an easy way to do directly from within your browser.

Install it in a few clicks from the following page: https://medialab.github.io/SearchEnginesBookmarklet/

It works as a small icon to drag and drop into your browser's bookmarks bar, allowing you to:

  • first switch from a regular search results page to one with up to a 100 results per page when the search engines allows it;
  • then download in one click the page's results as a CSV tabular file, or store them in the browser's memory and navigate to the next results page in order to download more results at once.

Install local version for development

# Install node's express dependency
npm install express

# Create an HTTPS key & certificate set
openssl genrsa -out key.pem
openssl req -new -key key.pem -out csr.pem
openssl x509 -req -days 9999 -in csr.pem -signkey key.pem -out cert.pem
rm csr.pem

# Run your local HTTPS server
node serve-https.js

# Edit SearchEnginesBookmarklet.js to comment the second line and uncomment the third one

# Load the following page in your browser to accept the unsafe certificate first
https://localhost:4443/

# Then install your development version of the bookmarklet as usual by dragging and dropping the image from that page into your bookrmarks bar

Credits & License

Benjamin Ooghe-Tabanou, Julien Pontoire & al @ Sciences Po médialab

Discover more of our projects at médialab tools.

SearchEnginesBookmarklet is a free open source software released under GPL 3.0 license.