Mysql

Charset and collation choices are made when defining schemas and databases during backend data modeling. Backend database setup is where utf8mb4 defaults, collation picks, and column migrations belong.

Also useful

Also useful

Where it fits

Example use

Set DEFAULT CHARACTER SET utf8mb4 on a new app database before first migration.

Example use

Verify user-generated content fields use utf8mb4 before go-live to avoid truncation.

Example use

Trace unexpected ORDER BY or unique constraint failures to a case-insensitive collation.

How it compares

Reference skill for charset DDL and collation semantics—not a connection-pooling or query-optimization MCP integration.

Common Questions / FAQ

Who is mysql (character sets) for?

Indie developers and small teams using MySQL or PlanetScale who own schema defaults and need Unicode-safe storage and sorting.

When should I use mysql (character sets)?

Use in Build when creating databases and tables, in Ship when hardening data before launch with international content, and in Operate when investigating sort or duplicate issues tied to collation.

Is mysql (character sets) safe to install?

It is documentation-style SQL guidance with no runtime permissions; review the Security Audits panel on this Prism page before trusting any third-party skill package in your agent.

SKILL.md

READMESKILL.md - Mysql

# Character Sets and Collations

## Always Use utf8mb4
MySQL's `utf8` = `utf8mb3` (3-byte only, no emoji/many CJK). Always `utf8mb4`.

```sql
CREATE DATABASE myapp DEFAULT CHARACTER SET utf8mb4 COLLATE utf8mb4_0900_ai_ci;
```

## Collation Quick Reference
| Collation | Behavior | Use for |
|---|---|---|
| `utf8mb4_0900_ai_ci` | Case-insensitive, accent-insensitive | Default |
| `utf8mb4_0900_as_cs` | Case/accent sensitive | Exact matching |
| `utf8mb4_bin` | Byte-by-byte comparison | Tokens, hashes |

`_0900_` = Unicode 9.0 (preferred over older `_unicode_` variants).

## Collation Behavior

Collations affect string comparisons, sorting (`ORDER BY`), and pattern matching (`LIKE`):

- **Case-insensitive (`_ci`)**: `'A' = 'a'` evaluates to true, `LIKE 'a%'` matches 'Apple'
- **Case-sensitive (`_cs`)**: `'A' = 'a'` evaluates to false, `LIKE 'a%'` matches only lowercase
- **Accent-insensitive (`_ai`)**: `'e' = 'é'` evaluates to true
- **Accent-sensitive (`_as`)**: `'e' = 'é'` evaluates to false
- **Binary (`_bin`)**: strict byte-by-byte comparison (most restrictive)

You can override collation per query:

```sql
SELECT * FROM users
WHERE name COLLATE utf8mb4_0900_as_cs = 'José';
```

## Migrating from utf8/utf8mb3

```sql
-- Find columns still using utf8
SELECT table_name, column_name FROM information_schema.columns
WHERE table_schema = 'mydb' AND character_set_name = 'utf8';
-- Convert
ALTER TABLE users CONVERT TO CHARACTER SET utf8mb4 COLLATE utf8mb4_0900_ai_ci;
```

**Warning**: index key length limits depend on InnoDB row format:
- DYNAMIC/COMPRESSED: 3072 bytes max (≈768 chars with utf8mb4)
- REDUNDANT/COMPACT: 767 bytes max (≈191 chars with utf8mb4)

`VARCHAR(255)` with utf8mb4 = up to 1020 bytes (4×255). That's safe for DYNAMIC/COMPRESSED but exceeds REDUNDANT/COMPACT limits.

## Connection
Ensure client uses `utf8mb4`: `SET NAMES utf8mb4;` (most modern drivers default to this).

`SET NAMES utf8mb4` sets three session variables:
- `character_set_client` (encoding for statements sent to server)
- `character_set_connection` (encoding for statement processing)
- `character_set_results` (encoding for results sent to client)

It also sets `collation_connection` to the default collation for utf8mb4.


---
title: Composite Index Design
description: Multi-column indexes
tags: mysql, indexes, composite, query-optimization, leftmost-prefix
---

# Composite Indexes

## Leftmost Prefix Rule
Index `(a, b, c)` is usable for:
- `WHERE a` (uses column `a`)
- `WHERE a AND b` (uses columns `a`, `b`)
- `WHERE a AND b AND c` (uses all columns)
- `WHERE a AND c` (uses only column `a`; `c` can't filter without `b`)

NOT usable for `WHERE b` alone or `WHERE b AND c` (the search must start from the leftmost column).

## Column Order: Equality First, Then Range/Sort

```sql
-- Query: WHERE tenant_id = ? AND status = ? AND created_at > ?
CREATE INDEX idx_orders_tenant_status_created ON orders (tenant_id, status, created_at);
```

**Critical**: Range predicates (`>`, `<`, `BETWEEN`, `LIKE 'prefix%'`, and sometimes large `IN (...)`) stop index usage for filtering subsequent columns. However, columns after a range predicate can still be useful for:
- Covering index reads (avoid table lookups)
- `ORDER BY`/`GROUP BY` in some cases, when the ordering/grouping matches the usable index prefix

## Sort Order Must Match Index

```sql
-- Index: (status, created_at)
ORDER BY status ASC, created_at ASC   -- ✓ matches (optimal)
ORDER BY status DESC, created_at DESC -- ✓ full reverse OK (reverse scan)
ORDER BY status ASC, created_at DESC  -- ⚠️ mixed directions (may use filesort)

-- MySQL 8.0+: descending index components
CREATE INDEX idx_orders_status_created ON orders (status ASC, created_at DESC);
```

## Composite vs Multiple Single-Column Indexes
MySQL can merge single-column indexes (`index_merge` union/intersection) but

What is this skill?

Always use utf8mb4 instead of legacy utf8 (utf8mb3) for full Unicode including emoji

Collation quick reference for utf8mb4_0900_ai_ci, _as_cs, and _bin with concrete use cases

Explains how collations affect ORDER BY, LIKE, and case/accent sensitivity

Per-query COLLATE override examples for exact matching

information_schema query to find columns still on utf8/utf8mb3 for migration

Documents 3 primary utf8mb4_0900 collation variants in a behavior table

utf8 maps to utf8mb3 (3-byte) in MySQL—not full Unicode

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 4.8k installs on skills.sh; 484 GitHub stars; 2/3 security scanners passed (skills.sh audits).

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

Also useful

Also useful

Where it fits

Example use

Set DEFAULT CHARACTER SET utf8mb4 on a new app database before first migration.

Example use

Verify user-generated content fields use utf8mb4 before go-live to avoid truncation.

Example use