[Python] How to get text from the span tag with itemprop="name" using BeautifulSoup?

Question 1

I want to extract some texts from a webpage. The texts are within the span tag with itemprop="name". How to specify itemprop="name" in the BeautifulSoup function to get values?

e.g.

<span itemprop="name">Stanford University</span>

Question 2

You can use itemprop="name" as an argument of the find_all() function to search all span tags with itemprop="name".

Here is an example:

from bs4 import BeautifulSoup
import urllib.request as ur

url = full_url_of_the_webpage
req = ur.Request(url, None, headers={'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/35.0.1916.47 Safari/537.36' })
rs = ur.urlopen(req)

soup = BeautifulSoup(rs, 'html.parser')
for sp in soup.find_all('span',itemprop="name"):
print(sp.text)

To run this code, replace "full_url_of_the_webpage" with actual url of the page.

pkumar81 · Answer 1 · 2022-09-08T21:14:23+0000

You can use itemprop="name" as an argument of the find_all() function to search all span tags with itemprop="name".

Here is an example:

from bs4 import BeautifulSoup
import urllib.request as ur

url = full_url_of_the_webpage
req = ur.Request(url, None, headers={'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/35.0.1916.47 Safari/537.36' })
rs = ur.urlopen(req)

soup = BeautifulSoup(rs, 'html.parser')
for sp in soup.find_all('span',itemprop="name"):
print(sp.text)

To run this code, replace "full_url_of_the_webpage" with actual url of the page.

[Python] How to get text from the span tag with itemprop="name" using BeautifulSoup?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Categories