Marketing & SalesDocumentedScanned

feishu-doc-reader

Read and extract content from Feishu (Lark) documents using the official Feishu Open API

Installation

npx clawhub@latest install feishu-doc-reader

View the full skill documentation and source below.

Documentation

Feishu Document Reader

This skill enables reading and extracting content from Feishu (Lark) documents using the official Feishu Open API.

Configuration

Set Up the Skill

Create the configuration file at ./reference/feishu_config.json with your Feishu app credentials:

{
  "app_id": "your_feishu_app_id_here",
  "app_secret": "your_feishu_app_secret_here"
}

Make sure the scripts are executable:

chmod +x scripts/read_doc.sh
chmod +x scripts/get_blocks.sh

Security Note: The configuration file should be kept secure and not committed to version control. Consider using proper file permissions (chmod 600 ./reference/feishu_config.json).

Usage

Basic Document Reading

To read a Feishu document, you need the document token (found in the URL: ). **Using the shell script (recommended):** __CODE_BLOCK_2__ ### Get Detailed Document Blocks (NEW) For complete document structure with all blocks, use the dedicated blocks script: __CODE_BLOCK_3__ **Using Python directly for blocks:** __CODE_BLOCK_4__ ### Supported Document Types - **Docx documents** (new Feishu docs): Full content extraction with blocks, metadata, and structure - **Doc documents** (legacy): Basic metadata and limited content - **Sheets**: Full spreadsheet data extraction with sheet navigation - **Slides**: Basic metadata (content extraction requires additional permissions) ## Features ### Enhanced Content Extraction - **Structured output**: Clean JSON with document metadata, content blocks, and hierarchy - **Complete blocks access**: Full access to all document blocks including text, tables, images, headings, lists, etc. - **Block hierarchy**: Proper parent-child relationships between blocks - **Text extraction**: Automatic text extraction from complex block structures - **Table support**: Proper table parsing with row/column structure - **Image handling**: Image URLs and metadata extraction - **Link resolution**: Internal and external link extraction ### Block Types Supported - **text**: Plain text and rich text content - **heading1/2/3**: Document headings with proper hierarchy - **bullet/ordered**: List items with nesting support - **table**: Complete table structures with cells and formatting - **image**: Image blocks with tokens and metadata - **quote**: Block quotes - **code**: Code blocks with language detection - **equation**: Mathematical equations - **divider**: Horizontal dividers - **page**: Page breaks (in multi-page documents) ### Error Handling & Diagnostics - **Detailed error messages**: Clear explanations for common issues - **Permission validation**: Checks required permissions before making requests - **Token validation**: Validates document tokens before processing - **Retry logic**: Automatic retries for transient network errors - **Rate limiting**: Handles API rate limits gracefully ### Security Features - **Secure credential storage**: Supports both environment variables and secure file storage - **No credential logging**: Credentials never appear in logs or output - **Minimal permissions**: Uses only required API permissions - **Access token caching**: Efficient token reuse to minimize API calls ## Command Line Options ### Main Document Reader __CODE_BLOCK_5__ ### Blocks Reader (NEW) __CODE_BLOCK_6__ ## API Permissions Required Your Feishu app needs the following permissions: -docx:document:readonly- Read document content -doc:document:readonly- Read legacy document content -sheets:spreadsheet:readonly- Read spreadsheet content ## Error Handling Common errors and solutions: - **403 Forbidden**: Check app permissions and document sharing settings - **404 Not Found**: Verify document token is correct and document exists - **Token expired**: Access tokens are valid for 2 hours, refresh as needed - **App ID/Secret invalid**: Double-check your credentials in Feishu Open Platform - **Insufficient permissions**: Ensure your app has the required API permissions - **99991663**: Application doesn't have permission to access the document - **99991664**: Document doesn't exist or has been deleted - **99991668**: Token expired, need to refresh ## Examples ### Extract document with full structure __CODE_BLOCK_7__ ### Get complete document blocks (NEW) __CODE_BLOCK_8__ ### Process spreadsheet data __CODE_BLOCK_9__ ### Extract only text content (Python script) __CODE_BLOCK_10__ ## Security Notes - **Never commit credentials**: Keep app secrets out of version control - **Use minimal permissions**: Only request permissions your use case requires - **Secure file permissions**: Set proper file permissions on secret files (chmod 600) - **Environment isolation**: Use separate apps for development and production - **Audit access**: Regularly review which documents your app can access ## Troubleshooting ### Authentication Issues 1. Verify your App ID and App Secret in Feishu Open Platform 2. Ensure the app has been published with required permissions 3. Check that environment variables or config files are properly set 4. Test with thetest_auth.pyscript to verify credentials ### Document Access Issues 1. Ensure the document is shared with your app or in an accessible space 2. Verify the document token format (should start withdocx_, doc_, or sheet_) 3. Check if the document requires additional sharing permissions ### Network Issues 1. Ensure your server can reachopen.feishu.cn2. Check firewall rules if running in restricted environments 3. The script includes retry logic for transient network failures ### Blocks-Specific Issues 1. **Empty blocks response**: Document might be empty or have no accessible blocks 2. **Missing block types**: Some block types require additional permissions 3. **Incomplete hierarchy**: Use--include-children` flag for complete block tree

References

[Feishu Open API Documentation]()
[Document API Reference]()
[Blocks API Reference]()
[Authentication Guide]()
[Sheet API Reference]()

Back to Skills Directory

Documentation

Feishu Document Reader

This skill enables reading and extracting content from Feishu (Lark) documents using the official Feishu Open API.

Configuration

Set Up the Skill

Create the configuration file at ./reference/feishu_config.json with your Feishu app credentials:

{
  "app_id": "your_feishu_app_id_here",
  "app_secret": "your_feishu_app_secret_here"
}

Make sure the scripts are executable:

chmod +x scripts/read_doc.sh
chmod +x scripts/get_blocks.sh

Security Note: The configuration file should be kept secure and not committed to version control. Consider using proper file permissions (chmod 600 ./reference/feishu_config.json).

Usage

Basic Document Reading

References

[Feishu Open API Documentation]()
[Document API Reference]()
[Blocks API Reference]()
[Authentication Guide]()
[Sheet API Reference]()

feishu-doc-reader

Installation

Documentation

Feishu Document Reader

Configuration

Set Up the Skill

Usage

Basic Document Reading

References

Related Skills in Marketing & Sales

4chan-reader

ab-test-setup

affiliatematic

apollo

attio

feishu-doc-reader

Installation

Documentation

Feishu Document Reader

Configuration

Set Up the Skill

Usage

Basic Document Reading

References

Related Skills in Marketing & Sales

4chan-reader

ab-test-setup

affiliatematic

apollo

attio