Document Management
Document management is a core feature of Knowledge Base, supporting multiple document sources and formats.
π₯ Adding Documentsβ
Supported Document Sourcesβ
| Source | Description |
|---|---|
| File Upload | Upload files from local computer |
| Text Paste | Paste text content directly |
| External Table | Import from DingTalk/Feishu tables |
| Web Scraping | Scrape content from URL |
File Uploadβ
- Click Add Document β Upload File
- Select file(s) from your computer
- Configure chunking settings (optional)
- Click Upload
Supported formats:
.txt- Plain text files.md- Markdown files.pdf- PDF documents.doc,.docx- Word documents
Text Pasteβ
- Click Add Document β Paste Text
- Enter document title
- Paste or type content
- Click Save
Suitable for quickly creating small documents.
External Tableβ
- Click Add Document β External Table
- Enter table URL (DingTalk/Feishu)
- Configure sync settings
- Click Import
Supports importing data from online table services.
Web Scrapingβ
- Click Add Document β Web URL
- Enter the webpage URL
- System scrapes and processes content
- Click Import
Web documents support re-scraping for updates. When webpage content changes, use the refresh feature to get the latest content.
π Document Listβ
List Featuresβ
- Search: Search documents by name
- Sort: Sort by name, size, or date
- Filter: Filter documents by status
Document Statusβ
| Status | Description |
|---|---|
| Enabled | Document is indexed and searchable |
| Disabled | Document exists but excluded from search |
| Processing | Document is being indexed |
| Error | Indexing failed |
βοΈ Management Operationsβ
Basic Operationsβ
| Operation | Description |
|---|---|
| View Details | View document content and metadata |
| Edit | Modify document name and settings |
| Enable/Disable | Toggle document search participation |
| Re-index | Reprocess document with new settings |
| Delete | Remove document permanently |
| View Chunks | Inspect how document was split |
Batch Operationsβ
Support multi-select for batch operations:
- Use checkboxes to select multiple documents
- Click Select All to select all documents
- Click Batch Delete to delete selected documents
π Document Selection in Notebook Modeβ
In Notebook mode, the document list supports selection features:
Selection Featuresβ
- Select Specific Documents: Check documents to include in context
- Select All / Deselect All: Quickly select or deselect all documents
- Auto-selection: Newly uploaded documents are automatically selected
Context Injectionβ
Selected documents are provided as context to the AI during conversations, helping the AI better understand and answer questions.
π Document Editingβ
Editable Contentβ
- Document Name: Modify the display name
- Chunking Settings: Adjust document chunking strategy
- Enable Status: Control whether document participates in retrieval
Editing Limitationsβ
- Source Type: Cannot change document source type
- File Content: File-type document content cannot be directly edited
- Table URL: External table URLs cannot be directly modified
π Web Document Refreshβ
Web documents support re-scraping:
- Find the web document in the document list
- Click the Refresh button
- System will re-scrape the webpage content
- Updated content will be automatically re-indexed
Suitable for tracking frequently updated web content.
π‘ Best Practicesβ
Document Organizationβ
| Practice | Description |
|---|---|
| Meaningful names | Use descriptive document names |
| Consistent format | Standardize document formatting |
| Regular updates | Re-index when documents change |
| Clean content | Remove irrelevant headers/footers |
Document Sizeβ
- Single file recommended not to exceed 50MB
- Large documents can be split into multiple smaller documents
- Text documents are easier to process than scanned PDFs
π Related Documentationβ
- User Guide - Complete knowledge base guide
- Chunking Strategies - Learn how documents are split