User Tools

Site Tools


projects:digikey_partsdb

This is an old revision of the document!


digikey parts slurper

fetch www.digikey.com/product-search/en?FV=

grep for catfilterlink

remove beginning of line to inclusive

remove end of line from inclusive

produces following info
grabbing FV's

we need the FV's to crawl each subsection.

  • <input type=hidden name=FV value=fff40000,fff80000>

also grab the total page count

  • <a class=“Last” href=”/product-search/en/undefined-category/undefined-family/0/page/8”>Last</a>

The page/8 is the total page count, pages start from 1

grab the FV value and page count, and store for each of the above URL's

crawl individual pages
projects/digikey_partsdb.1381592654.txt.gz · Last modified: 2013/10/12 08:44 by charliex