Overview
The Text Extraction node uses AI to extract text and metadata from different content sources such as PDFs, web pages, images, or audio files. It converts raw files or URLs into structured, readable text that can be used by other nodes in your workflow.Description
This node is designed to automatically pull text and metadata from a variety of input formats — including documents, websites, screenshots, or speech recordings. Depending on the selected extraction type, it can:- Read and extract text from PDF documents.
- Capture and process the textual content of a webpage.
- Identify and read text from images (OCR-based).
- Convert speech-to-text from audio files.
Input Parameters
The Text Extraction node accepts flat key-value inputs that determine the source and method of text extraction.-
extractionType (string, required)
Defines the type of extraction to perform. Supported values:
"pdf-extraction"– Extracts text from PDF documents."web-extraction"– Extracts text and metadata from a webpage."image-extraction"– Extracts text from images using OCR (Optical Character Recognition)."audio-extraction"– Converts spoken content from audio files into text.
-
webUrl (string, required for web-extraction)
The URL of the web page to extract content from.
Example:
-
attachments (string, required for pdf/image/audio extraction)
Comma-separated list of file IDs or variable references to uploaded files.
Each file represents one source of extraction.
Example:
or
Output Parameters
After execution, the Text Extraction node returns the extracted text along with processing information and detailed metadata.- processingCount Number of files, pages, or segments processed during extraction.
- processingTime Total time taken for extraction, returned in ISO timestamp format.
- processingId A unique identifier assigned to this specific extraction request.
- content The main text extracted from the provided source. For audio extraction, this will contain the full transcription.
- markdown A markdown-formatted version of the extracted text for easy readability and formatting.
-
linksOnPage[]
A list of links found on the page (only applicable for
web-extraction). - metadata.title The page or document title.
- metadata.keywords Extracted keywords or tags found in metadata.
- metadata.description Short description or summary found in the document or webpage metadata.
-
metadata.robots
Robot meta instructions (e.g.,
index,follow). - metadata.ogTitle Open Graph title of the page.
- metadata.ogDescription Open Graph description of the page.
- metadata.ogImage URL of the Open Graph image associated with the webpage.
- metadata.ogSiteName The name of the website where content was extracted.
-
metadata.screenshot
Screenshot image reference captured during extraction (for
web-extraction). - speaker_labels[].speaker For audio extractions — identifies the speaker label (e.g., Speaker 1, Speaker 2).
- speaker_labels[].text Text spoken by each identified speaker segment.
- timestamps[].startTime Timestamp indicating when a specific word or phrase began in the audio file.
- timestamps[].endTime Timestamp indicating when a specific word or phrase ended.
- timestamps[].word Word or phrase detected during speech-to-text transcription.
- timestamps[].duration (string) Duration of the spoken segment.
Output Type
The output type must always be exactly:Example Usage
Example 1: Extract Text from a PDF
Example 2: Extract Text from Image
Example 3: Extract Text from Audio
How to Use in a No-Code Workflow
- Add the Text Extraction Node Drag and drop the node into your workflow.
-
Select Extraction Type
Choose the appropriate extraction type:
"pdf-extraction"for PDF files"web-extraction"for web pages"image-extraction"for scanned images or screenshots"audio-extraction"for speech-to-text conversion
-
Provide Input
- For PDFs, images, or audio: add file references under
attachments. - For webpages: enter the target URL in
webUrl.
- For PDFs, images, or audio: add file references under
- Run the Node The node will extract text, metadata, or transcription automatically.
-
Access Results
Use variable syntax to retrieve outputs such as:
- Connect to Next Nodes You can pass the extracted text to other nodes like Text Generation, AI Analysis, or Summarization for further automation.
Best Practices
- Ensure the correct extraction type is selected before execution.
- When extracting from multiple files, separate file IDs with commas.
- For web extractions, make sure the URL is publicly accessible.
- Use OCR-friendly images (clear text visibility) for best results.
- For large audio files, split them into smaller segments to improve performance.
- Always review extracted content for accuracy before using it in production workflows.
Example Workflow Integration
Use Case: Automatically summarize a PDF report.- Step 1: The File Upload Node provides the PDF file reference.
- Step 2: The Text Extraction Node extracts all text from the PDF.
- Step 3: The Text Generation Node summarizes the extracted content into key points.
- Step 4: The Email Node sends the summary to a user.
Common Errors
-
“Missing extractionType”
Cause: No extraction type was provided.
Solution: Always specify one of
"pdf-extraction","web-extraction","image-extraction", or"audio-extraction". -
“Missing webUrl”
Cause: The
webUrlfield is required forweb-extraction. Solution: Add a valid, publicly accessible URL. -
“Missing attachments”
Cause: No file references provided for file-based extraction.
Solution: Add valid file IDs or variable references in the
attachmentsfield. - “Unsupported file format” Cause: The file type is not supported for extraction. Solution: Use PDF, image, or audio files in standard formats.
- “Empty output” Cause: The AI was unable to extract content from the provided source. Solution: Recheck file quality, accessibility, or try a different extraction type.

