Get urls of a given site from queries

Question

I'm trying to get URLs from a website based on keywords. I want to print only the first 10 results (to avoid the error of many requests)

import urllib
import requests
from bs4 import BeautifulSoup

queries = ["ner", "spacy", "bert", "lda"]

for i in queries:
    reqs = requests.get("https://github.com/search?q=" + str(i))
    soup = BeautifulSoup(reqs.text, 'html.parser')

    for links in soup.select('a'):
        print(links.get('href'))

My output:

https://github.com/
/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2Fsearch&source=header
/features/actions
/features/packages
/features/security
/features/codespaces
/features/copilot
/features/code-review
/features/issues
/features/discussions
/features
https://docs.github.com
https://skills.github.com/

I was looking for a list of links that contain one of these words...

Get urls of a given site from queries

Answers (1)

Example

Output

Related Questions