In the corporate sector, text data flows swiftly, emanating from diverse sources in various formats, destined for multiple endpoints and actions. This continuous stream of data necessitates management through structuring, making it easy to interpret and analyse, and ultimately directing it to the appropriate user or applying the insights obtained. I utilised several large language models (LLMs) streamline and enhance this process:
- A question-answering LLM, enabling users to inquire about the content of each document within the flow.
- A summarization LLM, offering a concise overview of each document.
- An NLP model for sentiment analysis, determining the sentiment of each document.
- An NLP model that calculates an absolute similarity score among all documents.
- Clustering, which groups similar documents together and simultaneously identifies outlier documents.