LLM Optimization

9 months ago
87 Views
0 Uses

Prompt

ChatGPT
Claude
Grok
OpenRouter
Create a list of the 20 most influential papers on LLM workload benchmarking, in order of popularity include author, organization or company, year created, url link to the document, and count of citations
Also Create a list of top 15 use cases by volume in order for LLM’s for example Chatbot/virtual agent, code generation, search, legal and medical research e.t.c based only on the above 20 papers, cite the source papers in the results table, include the following columns:
Typical context Length, typical number of multistep interactions, recommended batch size, average kv usage per session, as well as stats such as recommended TTFT range, ITL range, end to end latency, Session TTL
Create three different tables of the above for following use case: 5 active sessions, 20 inactive sessions, 5000 active sessions, 100000 inactive sessions, 50000 active sessions and 1000000 inactive sessions,
ChatGPT
Claude
Grok
OpenRouter

Model Settings

Temperature

0.7

Max Tokens

2000