guides/related-data-and-reindexing.md

Select File
guides/related-data-and-reindexing.md

# Related data and reindexing

This guide is for the Phoenix or Ecto team that has moved beyond flat documents and now needs to answer the uncomfortable real-app question:

**What should happen when a record changes data that lives in someone else's search document?**

Examples:

- an `Author` name changes and post documents store `author_name`
- a `Tag` is renamed and product documents store tag labels
- an account changes status and many searchable child records should now filter differently
- a support ticket changes teams and search documents store team metadata

The short version is simple:

**Scrypath does not hide cross-record invalidation behind callback magic. Your app owns the fan-out.**

That is the least-surprise Ecto shape, and it keeps recovery honest when the blast radius is larger than one row.

## The rule of thumb

Use simple `Scrypath.sync_record/3` when the changed row is the same row that owns the search document.

Reach for explicit fan-out when:

- the indexed document includes data from associated records
- a shared lookup table changes many documents at once
- a permission or tenant boundary change affects search visibility
- the shape of the document changed and you no longer trust the live index

If you remember one sentence from this guide, make it this:

**Find affected source records explicitly, then sync or rebuild them deliberately.**

## Three kinds of related-data changes

### 1. One changed row, one changed document

This is the easy case.

Example:

- a `Post` title changes
- a `Customer` email changes
- a `Ticket` status changes and the document is built from the ticket row itself

Do this:

- persist the row in your context
- call `Scrypath.sync_record/3` for that row

You do not need a broader recovery workflow unless the sync fails or the contract changed.

### 2. One changed row affects many other documents

This is the common real-app footgun.

Example:

- an `Author` rename should update every searchable post that stores `author_name`
- a `Category` label change should update every product document that stores category names
- an organization rename should update many user-facing result documents

Do this:

- persist the changed row in its owning context
- query for the affected source records explicitly
- sync those source records in a deliberate follow-up step

In Ecto terms, this usually means:

- identify the source schema that owns the search document
- build one query that finds affected records
- decide whether the follow-up should be inline, queued, or manual

The important thing is what you do **not** promise:

- no hidden association walking
- no automatic preload magic
- no assumption that a related-row change is cheap

### Fan-out with `Scrypath.sync_related/3`

When one changed row fans out to many search documents, Scrypath ships an explicit
entrypoint for it: `Scrypath.sync_related/3`. The context invokes it deliberately; it
is not a hidden callback. You declare the fan-out as metadata on the **owning** schema,
write one resolver that finds the affected target records, and the context calls
`sync_related/3` choosing inline or durable (Oban) execution.

There is no public worker macro to write. The `:oban` path is dispatched by an internal
Scrypath worker; adopters never name or `use` a worker module. You select
durable execution with `sync_mode: :oban` and point Scrypath at your Oban instance.

#### (a) Declare the fan-out on the owning schema

The fan-out lives on the schema whose change drives the update — here `Author`, because an
author rename should re-sync every post that stores `author_name`. `fan_outs:` names a key
(`:posts`), a `target:` schema (the schema that owns the search document), and a `resolver:`
MFA tuple that maps the changed owner(s) to the target records to re-sync:

```elixir
defmodule MyApp.Accounts.Author do
  use Ecto.Schema

  schema "authors" do
    field(:name, :string)
    has_many(:posts, MyApp.Blog.Post)
    timestamps()
  end

  # Owning-side fan_outs: declaration consumed by Scrypath.sync_related/3.
  # The resolver MFA must handle BOTH arities (inline -> Author records, oban -> author IDs).
  def __scrypath__(:fan_outs) do
    [
      posts: [
        target: MyApp.Blog.Post,
        resolver: {MyApp.Accounts, :resolve_posts_for_authors, []}
      ]
    ]
  end

  # Used by the :oban fan-out enqueue path (Scrypath.Identity.document_ids/2).
  def __scrypath__(:document_id), do: :id

  def changeset(author, attrs) do
    author
    |> Ecto.Changeset.cast(attrs, [:name])
    |> Ecto.Changeset.validate_required([:name])
  end
end
```

> **Note:** `use Scrypath` does not generate a `__scrypath__(:fan_outs)` accessor.
> The macro generates reflection keys for fields, settings, and document identity
> (see `lib/scrypath/schema.ex`) but not for fan-out declarations. Until macro-expansion
> support for `fan_outs:` is added, declare `__scrypath__(:fan_outs)` and
> `__scrypath__(:document_id)` by hand, as shown above. This is the same pattern
> used by the library's own hermetic fan-out tests.

The resolver MFA is invoked as `apply(mod, fun, [first_arg | extra_args])`. The `[]` above
means no extra args; `first_arg` is supplied by Scrypath and differs by mode (see (c)).

#### (b) Call `sync_related/3` from the context — inline or oban

The owning context persists the change, keeps any denormalized projection in sync, and then
invokes the fan-out explicitly. `opts[:fan_out]` is **required** — it names a key in the
schema's `fan_outs:` declaration, and `sync_related/3` raises `ArgumentError` if it is absent.

Inline execution resolves and syncs the affected records **now**, in the calling process:

```elixir
# inline (context-side) — resolves + syncs in this process, returns when done
{:ok, %{mode: :inline, status: :completed}} =
  Scrypath.sync_related(Author, updated_author,
    fan_out: :posts, sync_mode: :inline, backend: Scrypath.Meilisearch)
```

Oban execution enqueues the internal `RelatedWorker` and returns immediately — the work is
**durably queued**, not done:

```elixir
# oban (context-side) — "durably queued", NOT "searchable now"
{:ok, %{mode: :oban, status: :accepted}} =
  Scrypath.sync_related(Author, updated_author,
    fan_out: :posts, sync_mode: :oban, oban: MyApp.Oban,
    oban_queue: :scrypath, backend: Scrypath.Meilisearch)
```

Note the status difference: `:inline` returns `status: :completed` (the documents are
synced); `:oban` returns `status: :accepted` (the job is queued and will run later).

#### (c) Write an arity-safe resolver (the #1 footgun)

The resolver receives a **different first argument depending on the mode**, and this is the
single most common mistake when adopting `sync_related/3`:

- The **inline** path hands the resolver a list of owning-schema **records** (`[%Author{}]`).
- The **oban** path hands the resolver a list of owning-schema **document IDs** (`[author_id]`),
  because the Author struct is discarded at enqueue time — only the IDs round-trip through the
  Oban job. The resolver can never trust `.name` to be loaded on the oban path.

So the resolver must pattern-match **both shapes** and funnel both to a single reload-by-id
query against the target schema. Reloading by id makes the two arities converge:

```elixir
defmodule MyApp.Accounts do
  import Ecto.Query
  alias MyApp.Accounts.Author
  alias MyApp.Blog.Post
  alias MyApp.Repo

  def update_author(%Author{} = author, attrs, sync_opts) do
    {:ok, updated} =
      author |> Author.changeset(attrs) |> Repo.update()

    # Keep the denormalized projection in sync — app-owned, explicit, before fan-out.
    {_count, _} =
      from(p in Post, where: p.author_id == ^updated.id)
      |> Repo.update_all(set: [author_name: updated.name])

    # Explicit fan-out the context invokes — not a callback.
    {:ok, result} =
      Scrypath.sync_related(Author, updated, Keyword.put(sync_opts, :fan_out, :posts))

    {:ok, result, updated}
  end

  # Arity-safe resolver: inline passes Author RECORDS, oban passes Author DOCUMENT IDs.
  # Both funnel to a reload-by-author_id query so the reload is uniform.
  def resolve_posts_for_authors([%Author{} | _] = authors),
    do: authors |> Enum.map(& &1.id) |> reload_posts()

  def resolve_posts_for_authors([_id | _] = author_ids), do: reload_posts(author_ids)
  def resolve_posts_for_authors([]), do: []

  defp reload_posts(author_ids),
    do: Repo.all(from(p in Post, where: p.author_id in ^author_ids))
end
```

A resolver that handles only records (or only ids) will crash the `:oban` path on a value it
did not expect. Always handle both arities and reload from the database by id.

## 3. The document contract changed

Sometimes the question is not "which records changed?"

Sometimes the question is:

**Do I still trust the live index at all?**

Examples:

- you added or removed projected fields
- you changed filterable, sortable, or faceting declarations
- you changed settings in a way that implies a rebuild
- you cannot confidently enumerate every affected record

That is no longer a simple fan-out job.

This is when you move from sync into repair or rebuild:

- use index-contract drift or reconcile when you need posture first
- use backfill when the live index is still trustworthy and just needs repair
- use managed reindex when the contract changed or the live index no longer deserves trust

## A Phoenix context shape that stays honest

The clean shape is still:

- the owning context persists the original change
- the owning context decides whether there is fan-out work
- the context chooses the follow-up sync mode or operator path

For example, if posts own the search document but authors affect post documents:

1. `Accounts` updates the author
2. `Accounts` or a small orchestration boundary identifies affected posts
3. that boundary syncs those posts explicitly

The exact module split can vary. The important invariant is that web modules still do not own this logic.

## Picking the right follow-up path

The choice between inline and durable (Oban) execution comes down to two
questions: how large is the **blast radius** (how many rows the change affects),
and how much **request latency** can the calling request tolerate.

### Small blast radius, latency-tolerant caller

Good fit:

- a small, bounded number of affected rows
- the caller benefits from knowing the update finished now
- the request can absorb the extra latency of resolving and syncing in-process

Default:

- inline follow-up with `sync_mode: :inline`

This returns `{:ok, %{mode: :inline, status: :completed}}` only after the affected
documents are synced, so when the function returns you can truthfully say the work
is done.

### Larger blast radius, latency-sensitive request

Good fit:

- many affected rows
- request latency matters and the caller should not block on the fan-out
- your app already treats Oban as normal production infrastructure

Default:

- durable enqueue with `sync_mode: :oban`

This enqueues the internal `RelatedWorker` and returns
`{:ok, %{mode: :oban, status: :accepted}}` immediately — before any document is
synced. That changes what you are allowed to claim:

Truth you can say:

- "the follow-up work is durably queued"

Truth you cannot say:

- "all affected documents are searchable now"

#### What failures look like under `:oban`

When you choose `sync_mode: :oban`, the internal `RelatedWorker` — not your app code —
maps each sync outcome onto Oban's retry/cancel contract. Knowing this mapping tells you
which failures are permanent and which Oban will retry for you:

| Outcome of the underlying sync | RelatedWorker result | What happens |
| --- | --- | --- |
| Success (`:ok` / `{:ok, _}`) | `:ok` | Job completes. |
| Backend HTTP 4xx (e.g. bad request) | `{:cancel, ...}` | **Permanent** — Oban cancels the job, no retry. A 4xx is a client-side problem retrying will not fix. |
| Backend HTTP 5xx or generic error | `{:error, reason}` | **Transient** — Oban retries with backoff up to `max_attempts: 8` (the worker default). |
| Invalid schema / unknown fan-out | `{:cancel, {:invalid_job, reason}}` | **Permanent** — the job can never succeed, so it is cancelled rather than retried. |

You do not write or configure this mapping; the library owns it. Your resolver and context
stay focused on finding and syncing the right records.

### Unbounded or uncertain blast radius

Good fit:

- imports
- large migrations
- staged operator work
- situations where you cannot confidently bound the affected rows up front
- situations where you want to inspect before cutover

Default:

- manual flow, backfill, or managed reindex

When you cannot answer "how many rows will this touch?", neither `sync_mode: :inline`
nor `sync_mode: :oban` is the right reach — move to a deliberate operator path instead.

## Deletes are their own category

Deletes are where many libraries get hand-wavy. Scrypath should not.

If a source row is gone, a later async delete job may not be able to reload it. That is normal. The safe pattern is:

- carry enough identity to delete the document directly from the index
- do not depend on reloading the deleted source row later

When stale deletes still appear:

- inspect failed work
- inspect reconcile and drift posture
- decide whether one explicit retry is enough or whether the index needs broader repair

## Tenant and permission changes are related-data changes too

In SaaS apps, not every related-data change is cosmetic.

Sometimes the changed data affects **who should be able to see the document at all**.

Examples:

- a record moves tenants
- a team membership change affects searchable visibility
- a role or account-status change changes which rows should appear

Treat these as higher-risk related-data events:

- be explicit about the tenant boundary in app code
- be explicit about which source records are affected
- prefer a repair or rebuild workflow when the visibility model changed broadly

Scrypath can help with sync, search, and recovery. It should not pretend index prefixes alone are your authorization model.

## When to choose backfill versus reindex

Use this shortcut:

- **backfill** when the live index contract is still right and the data just needs repair
- **reindex** when the contract changed or you no longer trust the live index

Examples where backfill is usually right:

- a queue outage delayed some updates
- a bounded set of related-data fan-out jobs failed
- the document shape is unchanged and you trust the current index settings

Examples where reindex is usually right:

- projected fields changed broadly
- filterable or faceting declarations changed
- settings changed in a way that implies a rebuild
- you cannot confidently identify all affected source records

## A practical decision tree

Ask these questions in order:

1. Did the changed row own the document directly?
2. If not, can I clearly query the affected source records?
3. Is the blast radius small enough for normal sync, or is this now repair work?
4. Do I still trust the live index contract?
5. What truth do I need to be able to say when this function returns?

That yields the usual outcomes:

- direct sync
- explicit fan-out sync
- queued fan-out sync
- backfill
- managed reindex

## What Scrypath should stay opinionated about here

These are the least-surprise defaults:

- contexts own orchestration
- the library owns execution
- related-data propagation is explicit
- request-edge helpers do not own fan-out logic
- `use Scrypath` stays metadata-only
- recovery is public and documented

That split is the heart of `sync_related/3`: **contexts own orchestration** — your code
decides *when* to fan out, *which* records are affected, and *which* mode to run — while the
**library owns execution** — Scrypath runs the resolver, dispatches the internal
`RelatedWorker` under `:oban`, and maps failures onto the retry/cancel contract. Neither side
reaches into the other.

The tempting alternative is callback magic. That shape feels easy until the first production drift incident. Then it becomes hard to reason about, hard to test, and hard to repair.

## Read next

- [JTBD and user flows](jtbd-and-user-flows.md)
- [Sync modes and visibility](sync-modes-and-visibility.md)
- [Drift recovery](drift-recovery.md)
- [Operator Mix tasks](operator-mix-tasks.md)