RAGPostProcessor

1.0.1

PowerShell wrapper for the RAG Post Processor Apify actor. Clean and chunk scraped text for RAG and LLM pipelines in one command. Strips HTML, collapses whitespace, splits into overlapping chunks ready for OpenAI embeddings, Pinecone, Weaviate, LangChain, and LlamaIndex. Supports direct text input, file input, and Apify dataset chaining.

Minimum PowerShell version

5.1

Installation Options

Copy and Paste the following command to install this package using PowerShellGet More Info

Install-Module -Name RAGPostProcessor

Copy and Paste the following command to install this package using Microsoft.PowerShell.PSResourceGet More Info

Install-PSResource -Name RAGPostProcessor

You can deploy this package directly to Azure Automation. Note that deploying packages with dependencies will deploy all the dependencies to Azure Automation. Learn More

Manually download the .nupkg file to your system's default download location. Note that the file won't be unpacked, and won't include any dependencies. Learn More

Owners

Copyright

(c) 2026 jalicia. All rights reserved.

Package Details

Author(s)

  • jalicia

Tags

RAG LLM AI Apify TextProcessing Chunking Embeddings LangChain LlamaIndex OpenAI Pinecone Weaviate VectorDatabase TextCleaning NLP

Functions

Invoke-RAGPostProcessor

Dependencies

This module has no dependencies.

Release Notes

v1.0.1: Added dataset chaining support, improved field detection for 28 scraper output formats, fixed short-input handling, added file input support.

FileList

Version History

Version Downloads Last updated
1.0.1 (current version) 4 6/20/2026
1.0.0 4 6/20/2026