Henry Nguyễn
Jan 18, 2023

--

Hi Renato, wish I could provide a better solution but scraping on LinkedIn isn't quite feasible at large scale. I tried the approach of only run the scrape script when ever a user makes request but at thousands concurrent users, it would always fail. I even tried putting them all in queues but it still failed for large number of user.

It is entirely possible as long as the script isn't running as scraper (I know, shocking!). I had a stupid idea that the script would run like a user, open the Linkedin Profiles, take a screenshot then have another OCR processor to extract the text, then another text processor (may be NLP) to put the texts in the right places. But that's just way too complicated.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

Henry Nguyễn
Henry Nguyễn

Written by Henry Nguyễn

Product Manager @ LexisNexis Uk

No responses yet

Write a response