Workflow DevelopmentIntegration Nodes

Knowledge Base Node

AI-powered semantic search and document upload with metadata tagging and multi-collection support

The Knowledge Base node enables AI-powered semantic search and document upload capabilities in your workflows, integrating with your knowledge base collections to retrieve relevant information or index new documents with intelligent metadata tagging.

AI-Powered Knowledge Management

Semantic search and document upload with intelligent metadata and multi-collection support

2
Operations
Multi
Collection
6
Metadata Categories
8
File Formats

Overview

The Knowledge Base node is your gateway to intelligent document management within workflows. Whether you need to retrieve contextually relevant information using AI-powered semantic search or upload and index new documents with rich metadata tagging, this node provides enterprise-grade knowledge base integration.

Key Capabilities

Semantic Search

AI-powered document retrieval using natural language queries with relevance scoring and cross-collection search capabilities.

Document Upload

File upload with automatic chunking, embedding generation, and comprehensive metadata tagging for enhanced organization.

Collection Management

Multi-collection search support for comprehensive queries or single-collection upload for targeted document indexing.

Built-in Testing

Execute test searches with tabbed results display or test uploads with progress tracking before deploying workflows.

Use Cases

  • Technical Documentation Search: Find relevant API documentation, configuration guides, or troubleshooting steps using natural language queries
  • Compliance Document Upload: Index regulatory documents with metadata tags for GDPR, HIPAA, SOX, or other compliance frameworks
  • Cross-Collection Knowledge Retrieval: Search across multiple knowledge bases simultaneously to find comprehensive information
  • Automated Document Indexing: Upload documents from workflow triggers with automatic metadata extraction and tagging
  • Context-Aware Information Retrieval: Retrieve relevant information based on workflow context for dynamic decision-making
  • Regulatory Content Organization: Organize and retrieve compliance-related documents using metadata-based filtering

The Semantic Search operation enables AI-powered document retrieval using natural language queries with advanced filtering and relevance scoring.

1. Operation Selection

Select Semantic Search from the Operation dropdown to configure the node for document retrieval.

2. Collection Selection

Choose one or more collections to search across. The node supports multi-collection search, allowing you to query multiple knowledge bases simultaneously for comprehensive results.

  • Multi-Select: Select 2 or more collections for cross-collection search
  • Collection Details: View document count and chunk count for each collection
  • Status Indicator: Shows "Fully Configured" when all settings are complete

3. Query Configuration

Enter your search query using natural language. The AI-powered semantic search understands context and intent, not just keywords.

Query Input:

  • Natural language text (e.g., "How do I configure authentication in workflows?")
  • Supports expression binding from upstream workflow variables
  • Multi-line text area for complex queries

Example Queries:

What are the GDPR compliance requirements for data processing?
How do I implement OAuth 2.0 authentication?
What are best practices for error handling in API integrations?

4. Search Parameters

Fine-tune your search results using adjustable parameters:

Maximum Results (1-50, default: 10)

  • Controls the number of search results returned
  • Higher values provide more comprehensive results
  • Lower values focus on top matches

Similarity Threshold (0.0-1.0, default: 0.7)

  • Minimum confidence score for including results
  • Higher thresholds (0.8-1.0) return only highly relevant matches
  • Lower thresholds (0.5-0.7) return broader, more exploratory results

5. Advanced Metadata Filters (Optional)

Narrow search results by applying metadata filters across six categories. Filters use OR logic within categories and AND logic between categories.

Available Filter Categories:

  • Schema Types
  • Regulation Types
  • Data Types
  • Operations
  • Workflow Complexity
  • Workflow Domains

See the Metadata System section below for detailed filter options.

Example Filter Combination:

  • Schema Type: API, Database
  • Domain: Security, Web Development
  • Result: Documents tagged with (API OR Database) AND (Security OR Web Development)

6. Document Selection (Optional)

Limit your search scope to specific documents within the selected collections for improved relevance and speed.

Features:

  • Filter documents by name and metadata tags
  • View document upload date, file size, and chunk count
  • Group documents by collection for easier navigation
  • Select All / Clear buttons for bulk operations

When to Use Document Selection

Narrowing to selected documents can improve relevance and search speed when you know which documents contain the information you need. Leave unselected to search across all documents in the chosen collections.

7. Test Execution

Execute a test search to preview results before deploying your workflow.

Test Results Display:

  • Summary Tab: AI-generated summary with confidence score and key points
  • Results Tab: Individual search results showing:
    • Relevance score / match percentage
    • Document content preview
    • File name and collection
    • Associated metadata tags
  • Execution time tracking
  • Error handling with clear messages
Knowledge Base Node Semantic Search Configuration - Multi-collection selection, query input, search parameters, and metadata filters

Configuration Guide: Upload Document

The Upload Document operation enables file upload with automatic chunking, embedding generation, and metadata tagging for enhanced organization.

1. Operation Selection

Select Upload Document from the Operation dropdown to configure the node for document indexing.

2. File Upload Zone

Upload files using drag-and-drop or click to select from your file system.

Supported File Formats:

FormatExtensionsMax SizeUse Case
PDF.pdf10MBTechnical documentation, reports
Text.txt10MBPlain text documents, logs
Word.doc, .docx10MBBusiness documents, guides
Markdown.md10MBTechnical documentation, README files
JSON.json10MBConfiguration files, API responses
CSV.csv10MBData tables, spreadsheets
Excel.xlsx10MBBusiness reports, data analysis

Upload Process:

  1. Select or drag file to upload zone
  2. File is immediately uploaded to object storage
  3. File reference displays name, size, and type
  4. File is ready for indexing on workflow execution

3. Collection Selection

Choose a single collection to upload the document to. Unlike Semantic Search, document upload is restricted to one collection per operation.

  • Single-Select: Choose one collection from the dropdown
  • Collection Details: View current document count and chunk count
  • Status Indicator: Shows configuration status

4. Metadata Tagging (Optional)

Add metadata tags to organize and enhance document searchability. Tags are applied during upload and can be used for filtering in future searches.

Available Metadata Categories:

  • Schema Types (Database, API, Data Model, Scientific, Business)
  • Regulation Types (GDPR, HIPAA, SOX, PCI DSS, CCPA, etc.)
  • Data Types (Personal Data, Medical Records, Financial Data, etc.)
  • Operations (Processing, Storage, Transmission, Collection, etc.)
  • Workflow Complexity (Low, Medium, High, Enterprise)
  • Workflow Domains (Healthcare, Finance, E-Commerce, Education, etc.)

All metadata fields are optional. See the Metadata System section for complete details.

Metadata Best Practices

Adding metadata during upload improves document discoverability in future searches. Consider your organization's taxonomy and how documents will be searched when selecting tags.

5. Test Execution

Execute a test upload to verify file processing before deploying your workflow.

Test Upload Display:

  • Progress indicator (0-100%)
  • Results include:
    • Document ID
    • File hash
    • Chunks created count
    • Processing time (in seconds)
  • Success/error messages with clear feedback
Knowledge Base Node Upload Document - File upload zone, collection selection, metadata tagging, and test execution

Metadata System

The Knowledge Base node uses a comprehensive metadata system for both filtering searches and tagging uploads. Metadata is organized into six categories with predefined options.

Metadata Categories

CategoryOptionsUse Case
Schema TypesDatabase, API, Data Model, Scientific, BusinessCategorize technical documentation types
Regulation TypesGDPR, HIPAA, SOX, PCI DSS, CCPA, ISO 27001, FDA, FERPA, COPPAIdentify compliance frameworks and regulations
Data TypesPersonal Data, Medical Records, Financial Data, Payment Data, Educational Records, Biometric Data, Location Data, Behavioral DataClassify data sensitivity and privacy requirements
OperationsProcessing, Storage, Transmission, Collection, Deletion, Reporting, Access, ModificationDefine data handling operations covered
Workflow ComplexityLow, Medium, High, Enterprise (single-select)Indicate complexity level of workflows
Workflow DomainsHealthcare, Finance, E-Commerce, Education, Manufacturing, Logistics, Research, Web Development, Data Science, CybersecuritySpecify industry or functional domain

Filter Logic

When using metadata filters in searches:

  • Within Category: OR logic (e.g., selecting "API" and "Database" returns documents with either tag)
  • Between Categories: AND logic (e.g., selecting "API" in Schema Types and "Security" in Domains returns documents with both tags)

Metadata and Search Quality

Adding comprehensive metadata during upload significantly improves search quality:

  • Better Relevance: Metadata helps the AI understand document context
  • Faster Filtering: Pre-tagged documents enable quick filtering
  • Compliance: Regulatory tags help manage compliance requirements
  • Organization: Domains and complexity tags improve knowledge organization

Use Case Examples

Scenario: A workflow needs to find authentication configuration steps from technical documentation.

Configuration:

Operation: Semantic Search
Collections: Technical Docs, API Reference
Query: "How do I configure OAuth 2.0 authentication in REST APIs?"
Max Results: 10
Similarity Threshold: 0.75
Filters:
  - Schema Type: API
  - Domain: Web Development, Security

Expected Output:

  • Relevant documentation sections about OAuth 2.0
  • Configuration examples and code snippets
  • Security best practices
  • Relevance scores showing match confidence

Workflow Use: Retrieved information can be used to dynamically configure authentication in subsequent API calls or provide user guidance.

Example 2: Compliance Document Upload

Scenario: Automatically upload and index GDPR compliance documents with proper metadata.

Configuration:

Operation: Upload Document
File: GDPR_Data_Processing_Policy.pdf
Collection: Compliance Documents
Metadata:
  - Regulation Type: GDPR
  - Data Type: Personal Data
  - Operations: Processing, Storage, Deletion
  - Domain: Finance
  - Complexity: Enterprise

Result:

  • Document uploaded and chunked into searchable segments
  • Metadata tags applied for future filtering
  • Document indexed in Compliance Documents collection
  • Available for semantic search with GDPR filter

Workflow Use: Compliance documents can be automatically uploaded from external sources (email, cloud storage) and indexed with consistent metadata.


Best Practices

Collection Organization

  • Separate by Domain: Create separate collections for different domains (e.g., Technical, Legal, Business)
  • Multi-Collection Search: Use multi-collection search for comprehensive queries across domains
  • Single-Purpose Upload: Upload to specific collections to maintain organization

Metadata Tagging Guidelines

  • Be Consistent: Use the same metadata scheme across similar documents
  • Tag Generously: Add multiple relevant tags to improve discoverability
  • Review Periodically: Audit metadata tags to ensure they remain accurate and useful
  • Use All Categories: Don't limit yourself to one or two categories

Search Query Optimization

  • Use Natural Language: Write queries as questions or statements, not just keywords
  • Be Specific: More specific queries return more relevant results
  • Adjust Threshold: Start with 0.7 similarity threshold and adjust based on result quality
  • Limit Results: Use fewer max results (5-15) for focused queries

File Preparation Tips

  • Clean Text: Ensure documents have clean, readable text (especially for PDFs)
  • Appropriate Size: Keep files under 10MB for optimal processing
  • Meaningful Names: Use descriptive filenames for easier identification
  • Structured Content: Well-structured documents (headings, sections) chunk better

Troubleshooting

IssuePossible CauseSolution
No search results returnedSimilarity threshold too high or no matching documentsLower similarity threshold to 0.5-0.6, broaden query, or remove metadata filters
Search results not relevantQuery too broad or similarity threshold too lowMake query more specific, increase similarity threshold, or add metadata filters
File upload failsFile size exceeds 10MB or unsupported formatCheck file size and format, compress large files, or convert to supported format
Upload processing slowLarge file or complex document structureAllow additional processing time, simplify document structure, or split into smaller files
Collection not visibleCollection not created or permissions issueVerify collection exists in knowledge base settings, check user permissions

Next Steps

  • Collection Management: Learn how to create and manage knowledge base collections in the Knowledge Base section
  • Working with Data: Use dynamic expressions in queries and metadata with the Working with Data guide
  • Workflow Integration: Combine with other integration nodes like HTTP Request and Router
  • Document Processing: Understand chunking strategies and vector stores in Document Processing

Ask AI

FlowGenX Documentation

How can I help you?

Ask me anything about FlowGenX AI - workflows, agents, integrations, and more.

AI responses based on FlowGenX docs