scrape ASIN from amazon URL using javascript

前端 未结 16 723
旧巷少年郎
旧巷少年郎 2021-01-30 11:42

Assuming I have an Amazon product URL like so

http://www.amazon.com/Kindle-Wireless-Reading-Display-Generation/dp/B0015T963C/ref=amb_link_86123711_2?pf_rd_m=ATVP         


        
16条回答
  •  再見小時候
    2021-01-30 12:27

    None of the above work in all cases. I have tried following urls to match with the examples above:

    http://www.amazon.com/Kindle-Wireless-Reading-Display-Generation/dp/B0015T963C
    http://www.amazon.com/dp/B0015T963C
    http://www.amazon.com/gp/product/B0015T963C
    http://www.amazon.com/gp/product/glance/B0015T963C
    
    https://www.amazon.de/gp/product/B00LGAQ7NW/ref=s9u_simh_gw_i1?ie=UTF8&pd_rd_i=B00LGAQ7NW&pd_rd_r=5GP2JGPPBAXXP8935Q61&pd_rd_w=gzhaa&pd_rd_wg=HBg7f&pf_rd_m=A3JWKAKR8XB7XF&pf_rd_s=&pf_rd_r=GA7GB6X6K6WMJC6WQ9RB&pf_rd_t=36701&pf_rd_p=c210947d-c955-4398-98aa-d1dc27e614f1&pf_rd_i=desktop
    
    https://www.amazon.de/Sawyer-Wasserfilter-Wasseraufbereitung-Outdoor-Filter/dp/B00FA2RLX2/ref=pd_sim_200_3?_encoding=UTF8&psc=1&refRID=NMR7SMXJAKC4B3MH0HTN
    
    https://www.amazon.de/Notverpflegung-Kg-Marine-wasserdicht-verpackt/dp/B01DFJTYSQ/ref=pd_sim_200_5?_encoding=UTF8&psc=1&refRID=7QM8MPC16XYBAZMJNMA4
    
    https://www.amazon.de/dp/B01N32MQOA?psc=1
    

    This is the best I could come up with: (?:[/dp/]|$)([A-Z0-9]{10}) Which will also select the prepending / in all cases. This can then be removed later on.

    You can test it on: http://regexr.com/3gk2s

提交回复
热议问题