Log in to leave a comment
No posts yet
cache_control: {"type": "ephemeral"} marker immediately after the unchanging section.\n * Place Dynamic Data at the Bottom: Put variables that change every time\u2014such as question content, user IDs, and current time\u2014below the cache marker, at the very bottom of the prompt.\n\nA cost that was $0.06 for 20k tokens becomes $0.006\u2014one-tenth\u2014when it hits the cache. For services handling large documents, this is the point where the profit structure itself changes.\n\n-----\n\n## Chain of Draft (CoD) to Prevent Verbose Reasoning\n\nWhen asked to “Think step by step (CoT)” to solve complex problems, the model pours out long processes like a diary entry. That is all cost. The alternative is CoD (Chain of Draft). Instruct the model to reason briefly, like taking notes, in 5 words or less using core keywords. In arithmetic reasoning tests, while CoT used 172.5 tokens, CoD reached the same correct answer with only 31.3 tokens.\n\n### How to Apply CoD\n\n * Activate Draft Mode: Specify in the system prompt to “Record thoughts for each step in a draft format of 5 words or less.”\n * Specify Sources: If you are worried about hallucinations, add a minimal verification condition such as “Specify the supporting passage with a tag.”\n * Exception Clauses: Prevent quality degradation by leaving an escape hatch that allows detailed descriptions only for truly complex cases.\n\nYou can reduce output tokens by up to 92% while maintaining accuracy. Response latency is also cut by more than half.\n\n-----\n\n## Real-time Cost Monitoring and Profit Analysis\n\nAll this optimization is only meaningful if it is visible. If a shopping mall service making 300,000 calls per month combines prompt caching and CoD, the cost collapses from $4,500 to $660. In effect, about 5 million KRW in monthly operating profit is generated by a few lines of prompt modification..claudeignore file to prevent unnecessary files from being included in the context.$Cost = N \times (T_{in} \times P_{in} + T_{out} \times P_{out})$ in a spreadsheet and check performance weekly.\n\nDelete “Thank you” from your system prompts and change the data order today. That small inconvenience turns a monthly invoice into profit.