Chat Completion

Handles client LLM completion requests, applies rate limits based on tokens and requests, supports streaming or normal responses, and records usage.