*Result*: WaterLLMarks: In-data User Tracing for Distributed LLMaaS Environments

Title:
WaterLLMarks: In-data User Tracing for Distributed LLMaaS Environments
Source:
Proceedings of the 45th IEEE International Conference on Distributed Computing Systems Workshops (ICDCSW) (2025); 45th IEEE International Conference on Distributed Computing Systems (ICDCS), Glasgow, United Kingdom [GB], Jul. 2025
Publisher Information:
IEEE
Publication Year:
2025
Collection:
University of Luxembourg: ORBilu - Open Repository and Bibliography
Document Type:
*Conference* conference object<br />report
Language:
English
Rights:
open access ; http://purl.org/coar/access_right/c_abf2 ; info:eu-repo/semantics/openAccess
Accession Number:
edsbas.E8D7D07A
Database:
BASE

*Further Information*

*peer reviewed ; Due to their size and cost of operation, Large Language Models (LLMs) are often deployed in the cloud on Large Language Model as a Service (LLMaaS) platforms. Following the trends in cloudbased services and multi-tenancy, these platforms are becoming increasingly distributed. Monitoring such environments is challenging, especially without access to the underlying infrastructure. Moreover, applications may rely on multiple services from different providers, further complicating end-to-end observability. In this paper, we propose a novel type of telemetry, termed in-data telemetry, where services are outsourced across multiple LLMaaS providers. We illustrate its application in tracing user interactions with LLMs by embedding user-specific watermarks in the text. These watermarks propagate transparently through multiple services with minimal impact on the semantics of the generated content. We evaluate our approach in a Retrieval-Assisted Generation (RAG) scenario and showcase its potential in a demonstrator.*