Blog

Extracting all URLs on a Web Page with Chrome Developer Tools
Posted on October 15, 2024 in Google Chrome, JavaScript by Matt Jennings

Updated (October 15, 2021)

Someone in the comments asked how how then can return only URLs containing “abc” or “defg”. The info below contains that and other information to make your code compatible with older browsers if you want to use this JavaScript snippet in a website.

In Chrome go to a website you want to extract links from, like https://stackexchange.com/.
Follow steps 1 through 3 in under the Original Information section below.

Inside the Console panel paste the JavaScript below and press Enter:

const links = document.getElementsByTagName('a');

for(link of links) {
    console.log(link.getAttribute('href'));
}

Or if on https://wordpress.org/ you want to find all links that contain specific text (like “showcase”) in the URL will on very older browsers (like Internet Explorer 9 and above):

var links = document.getElementsByTagName('a');

for (var i = 0; i < links.length; i++) {
  var href = link[i].getAttribute('href');
  
  if (href && href.indexOf('showcase') > -1) {
    console.log(href);
  }
}

Or if you want to use modern code that will work in the Google Chrome browser but not very old browsers (like not Internet Explorer at all), use the code below to find all links that contain specific text (like “showcase”) on https://wordpress.org/:
```
const links = document.getElementsByTagName('a');

[...links].forEach((link) => {
    const url = link.getAttribute('href');
    const isShowcaseLink = url?.includes('showcase');
    if(isShowcaseLink) {
        console.log(url);
    }
});
```

Original Information

Thank you to Shan Eapen Koshy for positing a YouTube video on how to do this.

In Chrome, go the website that you want to extract links from, like https://stackexchange.com/.
Open Chrome Developer Tools by pressing Cmd + Opt + i (Mac) or F12 (Windows).
Click the Console panel near the top of Chrome Developer Tools.
Inside the Console panel paste the JavaScript below and press Enter:
```
var urls = document.getElementsByTagName('a');

for (url in urls) {
    console.log ( urls[url].href );
}
```
Now you will see all the links from that particular web page.
You can also click the Undock into a separate window button (in the upper-right of Chrome Developer Tools and just left of the X that you can click to close Chrome Developer Tools). This will open a separate window that only displays Chrome Developer Tools along with the extracted links.

16 Responses

Shwetha says:

December 13, 2018 at 21:37

But when the same code is written for chrome extension it gives “undefined” as the result

Reply
- Matt Jennings says:
  
  December 14, 2018 at 15:26
  
  I updated the code Shwetha. Thanks for reading my blog!
  
  Reply
William Pate says:

February 16, 2020 at 15:07

I love your code block in this post. Thus, I’m probably gonna steal it. 🙂

Reply
- Matt Jennings says:
  
  March 2, 2020 at 15:33
  
  Go for it William Pate!
  
  Reply
Calvin Gooley says:

January 5, 2021 at 05:34

Hi,

This is awesome — how would I designate only return certain urls?

For example, I want to return only URLs containing “abc” or “defg”.

Reply
- Matt Jennings says:
  
  April 1, 2021 at 04:03
  
  Hi Calvin,
  
  See my answer under the “Updated (April 1, 2021)” heading above. That includes the information you need.
  
  Reply
Sebastian says:

April 2, 2021 at 14:37

Great code, however I have a problem with it. I want to list all e-mail addresses from a website, but after replacing “showcase” to “mailto:” I’m getting an error: “Uncaught TypeError: Cannot read property ‘includes’ of null
at :3:34”. Is there a way to make it work?

Reply
- Sebastian says:
  
  April 2, 2021 at 14:42
  
  OK, YouTube comment section under the original video solved it for me 😉 Below code works just fine:
  
  filteredString = ‘mailto:’;
  urls = $$(‘a’); for (url in urls) if (urls[url].href.toLowerCase().includes(filteredString)) console.log ( urls[url].href );
  
  Reply
  - Matt Jennings says:
    
    April 3, 2021 at 09:32
    
    I assume you are using jQuery Sebastian when you rewrote:
    urls = $$(‘a’);
    
    You will just need to remove one “$” character in the line above so it looks like:
    urls = $(‘a’);
    
    Reply
    - Uncool Coffee says:
      
      October 11, 2024 at 23:13
      
      The double `$$` is ok in Chrome’s developer console. It is the “querySelectorAll” selector and the double $ vs single means it will return an array.
      
      A refactoring of this code might look like this:
      
      “`
      console.log(Array.from($$(“a”))
      .filter(url => url.href.toLowerCase().includes(“/unique-string-in-target-urls/”))
      .map(url => url.href)
      .join(” “));
      “`
      
      Reply
      - Matt Jennings says:
        
        November 5, 2024 at 19:12
        
        @Uncool Coffee
        Your code would work on in Chrome Developer Tools console. However I wouldn’t recommend using $$ (like $$('a')) and instead would use document.querySelectorAll('a') as this JavaScript can be used on all browsers and isn’t limited to Chrome browser only.
Praveen says:

May 31, 2021 at 19:15

Hi Matt,

THanks for this code, I am trying to extract all the requests, like document request, xhr requests, Resource requests, and also like start time, complete time and load time for these requests.
Please let me know, if I can achieve this.

Praveen T

Reply
- Matt Jennings says:
  
  June 1, 2021 at 11:50
  
  Hi Praveen,
  
  Unfortunately I don’t know how to do this. Good luck with a Google search on how to do this.
  
  Reply
Tamil says:

February 7, 2022 at 09:56

is there is a way to change the URL from the background for example:

link from webpage
https://www[dot]facebook[dot]com

need to convert with the following format
https://example[dot]com?url=https://facebook%5Bdot%5Dcom

Reply
Kevin says:

May 27, 2022 at 11:44

i know this is old but it was 1 in google serp lol

hey how would i extract links say from tik tok comments

from only the comment div block

Reply
Drew says:

July 6, 2022 at 13:04

Thank you so much for this code. You saved me a ton of time. A website linked 50+ pdf files individually. Your code, along with another fella from NOAA, helped me avoid right clicking each one to download. Instead a simple scrape of the pdf links, and a script through command prompt to download the list of links- voila! 50 pdfs downloaded in a matter of seconds.- using stock Windows nonetheless!

Thank you again!!!!

Reply

Blog

Extracting all URLs on a Web Page with Chrome Developer Tools
Posted on October 15, 2024 in Google Chrome, JavaScript by Matt Jennings

Updated (October 15, 2021)

Original Information

16 Responses

Leave a Reply

Search

Categories

Blog

Extracting all URLs on a Web Page with Chrome Developer Tools Posted on October 15, 2024 in Google Chrome, JavaScript by Matt Jennings

Updated (October 15, 2021)

Original Information

16 Responses

Leave a Reply

Search

Categories

Category Search

Extracting all URLs on a Web Page with Chrome Developer Tools
Posted on October 15, 2024 in Google Chrome, JavaScript by Matt Jennings