“Python Web Scraping” Ответ

Python Web Scraping

#Scrapes Python's URL, version number and logo from its Wikipedia page:

# $ pip3 install requests beautifulsoup4
import requests, bs4, os, sys

WIKI_URL = 'https://en.wikipedia.org/wiki/Python_(programming_language)'
try:
    html       = requests.get(WIKI_URL).text
    document   = bs4.BeautifulSoup(html, 'html.parser')
    table      = document.find('table', class_='infobox vevent')
    python_url = table.find('th', text='Website').next_sibling.a['href']
    version    = table.find('th', text='Stable release').next_sibling.strings.__next__()
    logo_url   = table.find('img')['src']
    logo       = requests.get(f'https:{logo_url}').content
    filename   = os.path.basename(logo_url)
    with open(filename, 'wb') as file:
        file.write(logo)
    print(f'{python_url}, {version}, file://{os.path.abspath(filename)}')
except requests.exceptions.ConnectionError:
    print("You've got problems with connection.", file=sys.stderr)
Michael Kuksin

Python Web Scraping

$ python -m pip install requests
Misty Mink

Python Web Scraping

	#Run request
	url = "https://nanolooker.com/account/" + address
    response = requests.get(url)

    if response.ok:
        # Make some soup
    	soup = BeautifulSoup(response.text, "lxml")

        #Show the title
    	print("The title is: " + str(soup.title.string))
Supreme Oreo

Скрипт Python для сокраски данных с веб -сайта

import os
import requests
from bs4 import BeautifulSoup
url = "https://www.google.com/"
reponse = requests.get(url)
if reponse.ok:
 soup = BeautifulSoup(reponse.text, "lxml")
 title = str(soup.find("title"))
 title = title.replace("<title>", "")
 title = title.replace("</title>", "")
 print("The title is : " + str(title))
os.system("pause")
Harendra

Сскабливание в Интернете с использованием кода Python

>>> from bs4 import BeautifulSoup
>>> raw_html = open('contrived.html').read()
>>> html = BeautifulSoup(raw_html, 'html.parser')
>>> for p in html.select('p'):
...     if p['id'] == 'walrus':
...         print(p.text)

'I am the walrus'


import requests
from bs4 import BeautifulSoup

URL = 'https://www.monster.com/jobs/search/?q=Software-Developer&where=Australia'
page = requests.get(URL)

soup = BeautifulSoup(page.content, 'html.parser')
Blue-eyed Bug

Как сделать паутинг Python

Web Scraping  - Python
Tender Tuatara

Ответы похожие на “Python Web Scraping”

Вопросы похожие на “Python Web Scraping”

Больше похожих ответов на “Python Web Scraping” по TypeScript

Смотреть популярные ответы по языку

Смотреть другие языки программирования