SharePoint Document Indexer

Index documents from SharePoint so they become searchable in Open WebUI chat. Point this tool at a SharePoint folder, and it will download, parse, and store the documents for AI-powered search.

How it works
  1. Point to a folder — enter your SharePoint site URL and folder path below.
  2. Download & parse — the system downloads each file and uses Docling to extract structured text (headings, tables, layout).
  3. AI enrichment — a local AI model writes a short context summary for each chunk to improve search accuracy.
  4. Stored for search — enriched chunks are embedded with Voyage-4-large and stored in Qdrant. The "Document Search Assistant" in Open WebUI can then find and cite them.

Currently Indexed Documents

Loading indexed documents...

Index New Documents

Enter a SharePoint folder to index. All supported files in the folder will be processed.

The URL of your SharePoint site (e.g. the address bar when you're on the site home page).

Path within the document library, starting with /. Only files in this folder are indexed (not subfolders).

So we know who indexed what. Shows up in the table above.

Optional tag for filtering results later. Leave blank for general content.

This may take several minutes for large folders.