RAGPostProcessor
1.0.1
PowerShell wrapper for the RAG Post Processor Apify actor. Clean and chunk scraped text for RAG and LLM pipelines in one command. Strips HTML, collapses whitespace, splits into overlapping chunks ready for OpenAI embeddings, Pinecone, Weaviate, LangChain, and LlamaIndex. Supports direct text input, file input, and Apify dataset chaining.
Minimum PowerShell version
5.1
Installation Options
Owners
Copyright
(c) 2026 jalicia. All rights reserved.
Package Details
Author(s)
- jalicia
Tags
RAG LLM AI Apify TextProcessing Chunking Embeddings LangChain LlamaIndex OpenAI Pinecone Weaviate VectorDatabase TextCleaning NLP
Functions
Dependencies
This module has no dependencies.
Release Notes
v1.0.1: Added dataset chaining support, improved field detection for 28 scraper output formats, fixed short-input handling, added file input support.
FileList
- RAGPostProcessor.nuspec
- Public\Invoke-RAGPostProcessor.ps1
- Tests\Test-RAGPostProcessor.ps1
- RAGPostProcessor.psd1
- RAGPostProcessor.psm1
- README.md
Version History
| Version | Downloads | Last updated |
|---|---|---|
| 1.0.1 (current version) | 4 | 6/20/2026 |
| 1.0.0 | 4 | 6/20/2026 |