Execute Groq production deployment checklist and go-live procedures. Use when deploying Groq integrations to production, preparing for launch, or implementing go-live procedures. Trigger with phrases like "groq production", "deploy groq", "groq go-live", "groq launch checklist".
Use the skills CLI to install this skill with one command. Auto-detects all installed AI assistants.
Method 1 - skills CLI
npx skills i jeremylongshore/claude-code-plugins-plus-skills/plugins/saas-packs/groq-pack/skills/groq-prod-checklistMethod 2 - openskills (supports sync & update)
npx openskills install jeremylongshore/claude-code-plugins-plus-skillsAuto-detects Claude Code, Cursor, Codex CLI, Gemini CLI, and more. One install, works everywhere.
Installation Path
Download and extract to one of the following locations:
No setup needed. Let our cloud agents run this skill for you.
Select Provider
Select Model
Best for coding tasks
No setup required
Complete pre-launch checklist for deploying Groq-powered applications to production. Covers API key security, model selection, rate limit planning, fallback strategies, and monitoring setup.
.env files)gsk_ pattern in codellama-3.3-70b-versatile)llama-3.1-8b-instant)max_tokens set to actual expected output size (not context max)retry-after header implementedp-queue or similar)Groq.APIError, Groq.APIConnectionError)async function completionWithFallback(messages: any[]) {
try {
return await groq.chat.completions.create({
model: "llama-3.3-70b-versatile",
messages,
timeout: 15_000,
});
} catch (err
// /api/health or /healthz
export async function GET() {
const checks: Record<string, any> = { status: "healthy" };
const start = performance.now();
try {
await groq.chat.completions.
groq-incident-runbook)set -euo pipefail
# Pre-flight checks
echo "1. Groq API status..."
curl -sf https://status.groq.com > /dev/null && echo "OK" || echo "ISSUE"
echo "2. Production key valid..."
curl -sf https://api.groq.com/openai/v1/models \
-H "Authorization: Bearer $GROQ_API_KEY_PROD
| Alert | Condition | Severity |
|---|---|---|
| API errors spike | 5xx rate > 5/min | P1 |
| Latency degraded | p95 > 1000ms | P2 |
| Rate limited | 429 count > 5/min | P2 |
| Auth failure | Any 401 error | P1 |
| Spending near cap | >90% of monthly budget | P3 |
For version upgrades, see groq-upgrade-migration.