How to extract a part of the content of a Web page

Takeru · December 7, 2023, 7:28am

I would like to know how to extract part of the content of a web page.
I have a workflow to get the Feed and summarize it in ChatGPT.
I am able to get the full text of the body with “Send HTTP reqest”, but I want to extract only the elements in which the div is main.

Dennis · December 7, 2023, 7:37am

Takeru · December 7, 2023, 7:40am

I did not quite understand the reference.
I consulted with AI and they output the code below, do I just paste this somewhere?

import requests
from lxml import html

response = requests.get("URL")
tree = html.fromstring(response.content)

main_div = tree.xpath('//div[@id="main"]')[0]
print(main_div.text_content())

Takeru · December 13, 2023, 7:43am

Can someone tell me if anyone can figure this out?

Abdul · December 26, 2023, 9:08pm

Hello @Takeru

sorry for the late response, but could you please explain further what you mean by
but I want to extract only the elements in which the div is main

system · January 10, 2024, 9:09pm

This topic was automatically closed 15 days after the last reply. New replies are no longer allowed.