LLM Optimization

By julian_harty

9 months ago

87 Views

0 Uses

Prompt

Create a list of the 20 most influential papers on LLM workload benchmarking, in order of popularity include author, organization or company, year created, url link to the document, and count of citations
Also Create a list of top 15 use cases by volume in order for LLM’s for example Chatbot/virtual agent, code generation, search, legal and medical research e.t.c based only on the above 20 papers, cite the source papers in the results table, include the following columns:
Typical context Length, typical number of multistep interactions, recommended batch size, average kv usage per session, as well as stats such as recommended TTFT range, ITL range, end to end latency, Session TTL
Create three different tables of the above for following use case: 5 active sessions, 20 inactive sessions, 5000 active sessions, 100000 inactive sessions, 50000 active sessions and 1000000 inactive sessions,

LLM Optimization

Prompt

Model Settings

Temperature

Max Tokens

About the Author

Send Feedback

Submit Content

LLM Optimization

Prompt

Model Settings

Temperature

Max Tokens

About the Author

Related Ai Prompts

Share this prompt

Send Feedback