Skip to content

Commit

Permalink
# Release anyparser-langchain v0.0.2
Browse files Browse the repository at this point in the history
- Update version in pyproject.toml
- Update version in __init__.py
- Update version in test_version.py
- Minor cleanup in README.md
- Added changelog
  • Loading branch information
heyaibi committed Feb 17, 2025
1 parent 716ca0b commit c22fc09
Show file tree
Hide file tree
Showing 6 changed files with 106 additions and 16 deletions.
14 changes: 1 addition & 13 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,6 @@ https://anyparser.com
pip install anyparser-langchain
```


## Anyparser LangChain Examples

This `examples` directory contains examples demonstrating different ways to use the Anyparser LangChain integration.
Expand All @@ -26,7 +25,6 @@ python examples/07_ocr_json.py
python examples/08_crawler.py
```


## Setup

Before running the examples, make sure to set your Anyparser API credentials as environment variables:
Expand Down Expand Up @@ -92,16 +90,6 @@ export ANYPARSER_API_URL="https://anyparserapi.com"
- Error handling
- Async/await usage











## License

Apache-2.0
Apache-2.0
2 changes: 1 addition & 1 deletion anyparser_langchain/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
Anyparser LangChain Integration
"""

__version__ = "0.0.1"
__version__ = "0.0.2"

import asyncio
from typing import List, Literal, Optional, Union
Expand Down
7 changes: 7 additions & 0 deletions changelogs/v0.0.1-changelog.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
# Anyparser LangChain: Seamless Integration of Anyparser with LangChain

Todo

## License

Apache-2.0
95 changes: 95 additions & 0 deletions changelogs/v0.0.2-changelog.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,95 @@
# Anyparser LangChain: Seamless Integration of Anyparser with LangChain

https://anyparser.com

**Integrate Anyparser's powerful content extraction capabilities with LangChain for enhanced AI workflows.** This integration package enables seamless use of Anyparser's document processing and data extraction features within your LangChain applications, making it easier than ever to build sophisticated AI pipelines.

## Installation

```bash
pip install anyparser-langchain
```

## Anyparser LangChain Examples

This `examples` directory contains examples demonstrating different ways to use the Anyparser LangChain integration.

```bash
python examples/01_single_file_json.py
python examples/02_single_file_markdown.py
python examples/03_multiple_files_json.py
python examples/04_multiple_files_markdown.py
python examples/05_load_folder.py
python examples/06_ocr_markdown.py
python examples/07_ocr_json.py
python examples/08_crawler.py
```

## Setup

Before running the examples, make sure to set your Anyparser API credentials as environment variables:

```bash
export ANYPARSER_API_KEY="your-api-key"
export ANYPARSER_API_URL="https://anyparserapi.com"
```

## Examples

### 1. Single File Processing
- `01_single_file_json.py`: Process a single file with JSON output
- `02_single_file_markdown.py`: Process a single file with markdown output

### 2. Multiple File Processing
- `03_multiple_files_json.py`: Process multiple files with JSON output
- `04_multiple_files_markdown.py`: Process multiple files with markdown output
- `05_load_folder.py`: Load and process all files from a folder (max 5 files)

### 3. OCR Processing
- `06_ocr_markdown.py`: Process images/scans with OCR (markdown output)
- `07_ocr_json.py`: Process images/scans with OCR (JSON output)

### 4. Web Crawling
- `08_crawler_basic.py`: Basic web crawling with essential settings

## Features Demonstrated

### Document Processing
- Different output formats (markdown, JSON)
- Multiple file handling
- Folder processing
- Metadata handling

### OCR Capabilities
- Language support (ISO 639-2 codes)
- OCR presets (fast, balanced, scan)
- Image and table extraction

### Web Crawling
- Basic crawling with depth and scope control
- Advanced URL and content filtering
- Crawling strategies (BFS, LIFO)
- Rate limiting and robots.txt respect

## Notes

- All examples use async/await for better performance
- Error handling is included in all examples
- Each example includes detailed comments explaining the options used
- OCR examples support multiple languages
- Crawler examples demonstrate various filtering and control options

## Features Demonstrated

- Different output formats (markdown, JSON)
- OCR capabilities with language support
- OCR performance presets
- Image extraction
- Table extraction
- Metadata handling
- Error handling
- Async/await usage

## License

Apache-2.0
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ build-backend = "poetry.core.masonry.api"

[project]
name = "anyparser-langchain"
version = "0.0.1"
version = "0.0.2"
description = "Anyparser LangChain Integration"
readme = "README.md"
requires-python = ">=3.9"
Expand Down
2 changes: 1 addition & 1 deletion tests/test_version.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,4 +8,4 @@
def test_version():
"""Test version is a string."""
assert isinstance(__version__, str)
assert __version__ == "0.0.1"
assert __version__ == "0.0.2"

0 comments on commit c22fc09

Please sign in to comment.