Skip to main content

Extract Text from Web Page

Workflow task type to extract text from a web page.

Inputs

  • html: The HTML content of the web page (required input).

Outputs

  • text: The extracted text from the web page (default output).

Example Usage

{
"name": "extract_text_from_web_page",
"type": "web.extract_text",
"inputs": {
"html": "<html><body><h1>Hello, World!</h1></body></html>"
}
}

Specification

{
"specification": {
"type": "web.extract_text",
"category": "web",
"description": "Extract text from an HTML document",
"inputs": [
{
"name": "html",
"type": "text",
"required": true,
"description": "The HTML content of the web page",
"example": "<html><body><h1>Hello, World!</h1></body></html>"
}
],
"outputs": [
{
"name": "text",
"type": "text",
"default": true,
"description": "The extracted text from the HTML document",
"example": "Hello, World!"
}
]
}
}