diff --git a/README.md b/README.md index 75c6f2e..7d55b7c 100644 --- a/README.md +++ b/README.md @@ -1,3 +1,23 @@ -# Dataset of LinkedIn members who work for ICE +# LinkedIn members who work for ICE Browse it here: https://antiboredom.github.io/ice-linkedin/ + +Please note that this script requires a LinkedIn account with a premium membership + +In order to run the code, first clone the repository, then install requirements: + +``` +pip install -r requirements.txt +``` + +Next, you'll need to edit `header.py` with info about your linkedin account. This is a bit tedious. + +1. Log in to your linkedin account in Chrome +2. Go to the sales navigator page for ICE: https://www.linkedin.com/sales/search?pivotType=EMPLOYEE&pivotId=533534 +3. Open your browser's development tools, and then click on the `network` tab, and then select `XHR` +4. Scroll to the bottom of the page and click the `Next` button +5. You should see a url appear on the network tab. Right click and select `Copy as cURL` +6. Open up https://curl.trillworks.com/ and paste into the `curl command` area +7. Copy everything in between `headers = {}` into `header.py` + +Run the script with `python linked_in_scraper.py`