Advanced topics on federated entities

This article describes complex behaviors of federated entities beyond those covered in entity basics.

Using advanced `@key`s

Depending on your entities' fields and usage, you may need to use more advanced @keys. For example, you may need to define a compound @key if multiple fields are required to uniquely identify an entity. If different subgraphs interact with different fields an entity, you may need to define multiple—and sometimes differing—@keys for the entity.

Compound `@key`s

A single @key can consist of multiple fields, the combination of which uniquely identifies an entity. This is called a compound or composite key. In the following example, the combination of both username and domain fields is required to uniquely identify the User entity:

Users subgraph

1
type User @key(fields: "username domain") {
2
  username: String!
3
  domain: String!
4
}

Nested fields in compound `@key`s

Compound keys can also include nested fields. In the following example, the User entity's @key consists of both a user's id and the id of that user's associated Organization:

Users subgraph

1
type User @key(fields: "id organization { id }") {
2
  id: ID!
3
  organization: Organization!
4
}
5

6
type Organization {
7
  id: ID!
8
}

Multiple `@key`s

When different subgraphs interact with different fields of an entity, you may need to define multiple @keys for the entity. For example, a Reviews subgraph might refer to products by their ID, whereas an Inventory subgraph might use SKUs.

In the following example, the Product entity can be uniquely identified by either its id or its sku:

Products subgraph

1
type Product @key(fields: "id") @key(fields: "sku") {
2
  id: ID!
3
  sku: String!
4
  name: String!
5
  price: Int
6
}

Note: If you include multiple sets of @key fields, the query planner uses the most efficient set for entity resolution. For example, suppose you allow a type to be identified by @key(fields: "id") or @key(fields: "id sku"):

1
type Product @key(fields: "id") @key(fields: "id sku") {
2
  # ...
3
}

That means either id or (id and sku) is enough to uniquely identify the entity. Since id alone is enough, the query planner will use only that field to resolve the entity, and @key(fields: "id sku") is effectively ignored.

Referencing entities with multiple keys

A subgraph that references an entity without contributing any fields can use any @key fields in its stub definition. For example, if the Products subgraph defines the Product entity like this:

Products subgraph

1
type Product @key(fields: "id") @key(fields: "sku") {
2
  id: ID!
3
  sku: String!
4
  name: String!
5
  price: Int
6
}

Then, a Reviews subgraph can use either id or sku in the stub definition:

Reviews subgraph

1
# Either:
2
type Product @key(fields: "id", resolvable: false) {
3
  id: ID!
4
}
5

6
# Or:
7
type Product @key(fields: "sku", resolvable: false) {
8
  sku: String!
9
}

When resolving a reference for an entity with multiple keys, you can determine how to resolve it based on which key is present. For example, if you're using @apollo/subgraph, it could look like this:

resolvers.js

1
// Products subgraph
2
const resolvers = {
3
  Product: {
4
    __resolveReference(productRepresentation) {
5
      if(productRepresentation.sku){
6
        return fetchProductBySku(productRepresentation.sku);
7
      } else {
8
        return fetchProductByID(productRepresentation.id);
9
      }
10
    }
11
  },
12
  // ...other resolvers...
13
}

Differing `@key`s across subgraphs

Although an entity commonly uses the exact same @key field(s) across subgraphs, you can alternatively use different @keys with different fields. For example, you can define a Product entity shared between subgraphs, one with sku and upc as its @keys, and the other with only upc as the @key field:

Products subgraph

1
type Product @key(fields: "sku") @key(fields: "upc") {
2
  sku: ID!
3
  upc: String!
4
  name: String!
5
  price: Int
6
}

Inventory subgraph

1
type Product @key(fields: "upc") {
2
  upc: String!
3
  inStock: Boolean!
4
}

To merge entities between subgraphs, the entity must have at least one shared field between subgraphs. For example, operations can't merge the Product entity defined in the following subgraphs because they don't share any fields specified in the @key selection set:

❌

Products subgraph

type Product @key(fields: "sku") {
  sku: ID!
  name: String!
  price: Int
}

Inventory subgraph

type Product @key(fields: "upc") {
  upc: String!
  inStock: Boolean!
}

Operations with differing `@key`s

Differing keys across subgraphs affect which of the entity's fields can be resolved from each subgraph. Requests can resolve fields if there is a traversable path from the root query to the fields.

Take these subgraph schemas as an example:

Products subgraph

type Product @key(fields: "sku") {
  sku: ID!
  upc: String!
  name: String!
  price: Int
}

type Query {
  product(sku: ID!): Product
  products: [Product!]!
}

Inventory subgraph

type Product @key(fields: "upc") {
  upc: String!
  inStock: Boolean!
}

The queries defined in the Products subgraph can always resolve all product fields because the product entity can be joined via the upc field present in both schemas.

On the other hand, queries added to the Inventory subgraph can't resolve fields from the Products subgraph:

Products subgraph

type Product @key(fields: "sku") {
  sku: ID!
  upc: String!
  name: String!
  price: Int
}

Inventory subgraph

type Product @key(fields: "upc") {
  upc: String!
  inStock: Boolean!
}

type Query {
  productsInStock: [Product!]!
}

The productsInStock query can't resolve fields from the Products subgraph since the Products subgraph's Product type definition doesn't include upc as a key field, and sku isn't present in the Inventory subgraph.

If the Products subgraph includes @key(fields: "upc"), all queries from the Inventory subgraph can resolve all product fields:

Products subgraph

1
type Product @key(fields: "sku") @key(fields: "upc") {
2
  sku: ID!
3
  upc: String!
4
  name: String!
5
  price: Int
6
}

Inventory subgraph

1
type Product @key(fields: "upc") {
2
  upc: String!
3
  inStock: Boolean!
4
}
5

6
type Query {
7
  productsInStock: [Product!]!
8
}

Migrating entity fields and root fields

As your supergraph grows, you might want to move parts of one subgraph to another subgraph. This section describes how to migrate entity and root fields safely.

Using the `@override` directive

You can migrate between subgraphs all at once with @override.

💡 TIP

We recommend organizations with an Enterprise license to migrate gradually with progressive @override. See the guide Incremental migration with progressive @override.

Let's say the Payments subgraph defines a Bill entity:

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int!
  payment: Payment
}

type Payment {
  # ...
}

As your graph evolves, you decide to add a dedicated Billing subgraph to your supergraph. It makes sense to move billing functionality there, including the amount of a bill. You want the deployed subgraph schemas to look like this:

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  payment: Payment
}

type Payment {
  # ...
}

Billing subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int!
}

The @override directive enables you to incrementally migrate between subgraphs with no downtime.

Follow these steps to use the @override directive:

If the @override directive isn't already imported, include it in your schema's @link imports:

Billing subgraph

1
extend schema
2
  @link(url: "https://specs.apollo.dev/federation/v2.7",
3
        import: ["@key", "@shareable", "@override"])

Deploy a new version of the Billing subgraph that both defines and resolves the Bill fields you want to move:
Payments subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int!
payment: Payment
}

type Payment {
# ...
}
Billing subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int! @override(from: "Payments")
}
Applying the @override directive tells the router to resolve the amount field in the Billing subgraph instead of the Payments subgraph.

Update your router's supergraph schema to migrate to the updated Billing subgraph. If you're using managed federation, you do this by publishing the Billing subgraph's schema to GraphOS with rover subgraph publish.

When the router receives its updated supergraph schema, it immediately starts resolving the Bill.amount field from the Billing subgraph while continuing to resolve Bill.payment from the Payments subgraph.

ⓘ NOTE

We can migrate as many entity fields as we want in a single change. To do so, we apply @override to every entity field we want to move. We can even migrate entire entities this way.

Now that Bill.amount is resolved in the Billing subgraph, we can safely remove that field (and its resolver) from the Payments subgraph:

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  payment: Payment
}

type Payment {
  # ...
}

Billing subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int! @override(from: "Payments")
}

After making this change, we deploy our updated Payments subgraph and again update our router's supergraph schema.

ⓘ NOTE

Because the router is already ignoring Bill.amount in the Payments subgraph thanks to @override, we can safely publish our updated schema or deploy the subgraph in any order.

Remove the @override directive from the Billing subgraph, because it no longer has any effect:

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  payment: Payment
}

type Payment {
  # ...
}

Billing subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int!
}

After we deploy the Billing subgraph and publish this final schema change, we're done. We've migrated Bill.amount to the Billing subgraph with zero downtime.

Incremental migration with progressive `@override`
Since 2.7

You can migrate between subgraphs gradually with progressive @override.

Progressive @override is an Enterprise feature of the Apollo Router and requires an organization with a GraphOS Enterprise plan. If your organization doesn't have an Enterprise plan, you can test it out by signing up for a free Enterprise trial.

Let's say the Payments subgraph defines a Bill entity:

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int!
  payment: Payment
}

type Payment {
  # ...
}

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  payment: Payment
}

type Payment {
  # ...
}

Billing subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int!
}

The @override directive enables you to incrementally migrate between subgraphs with no downtime.

Follow these steps to use the @override directive:

If the @override directive isn't already imported, include it in your schema's @link imports:

Billing subgraph

1
extend schema
2
  @link(url: "https://specs.apollo.dev/federation/v2.7",
3
        import: ["@key", "@shareable", "@override"])

Deploy a new version of the Billing subgraph that both defines and resolves the Bill fields you want to move:
Payments subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int!
payment: Payment
}

type Payment {
# ...
}
Billing subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int! @override(from: "Payments", label: "percent(1)")
}
Applying the @override directive tells the router to resolve the amount field in the Billing subgraph instead of the Payments subgraph.
Adding a label argument to the @override directive sets the percentage of traffic to direct to the Billing subgraph. Start with a small percentage. Setting label: "percent(1)" means that 1 percent of the requests for amount are resolved by the Billing subgraph, while the remaining 99 percent are resolved by the Payments subgraph.
Update your router's supergraph schema to begin the migration to the updated Billing subgraph.

When the router receives its updated supergraph schema, it starts resolving the Bill.amount field from the Billing subgraph approximately 1% of the time, while continuing to resolve it from the Payments subgraph the other 99%.

ⓘ NOTE

We can migrate as many entity fields as we want in a single change. To do so, we apply @override to every entity field we want to move. We can even migrate entire entities this way.

Gradually and iteratively increase the percent of traffic directed to the Billing subgraph, update your router's supergraph schema, and validate the performance of the Billing subgraph. Continue until the migration is completed with label: "percent(100)" and all traffic is resolved by the Billing subgraph.
Billing subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int! @override(from: "Payments", label: "percent(100)")
}
Now that Bill.amount is resolved in the Billing subgraph, we can safely remove that field (and its resolver) from the Payments subgraph:

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  payment: Payment
}

type Payment {
  # ...
}

Billing subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int! @override(from: "Payments")
}

After making this change, we deploy our updated Payments subgraph and again update our router's supergraph schema.

ⓘ NOTE

Because the router is already ignoring Bill.amount in the Payments subgraph thanks to @override, we can safely publish our updated schema and deploy the subgraph in any order.

Remove the @override directive from the Billing subgraph because it no longer has any effect:

Payments subgraph

type Bill @key(fields: "id") {
  id: ID!
  payment: Payment
}

type Payment {
  # ...
}

Billing subgraph

type Bill @key(fields: "id") {
  id: ID!
  amount: Int!
}

After we deploy the Billing subgraph and publish this final schema change, we're done. We've migrated Bill.amount to the Billing subgraph with zero downtime.

Safe usage of progressive `@override`

When using progressive @override, a single operation can now result in multiple query plans. Query plans are cached by the router, with the set of unique, overridden labels contributing to the cache key.

Prior to progressive @override, only a single query plan was generated for a given operation. With progressive @override, the number of query plans doubles for each unique label in the operation's "path".

A few strategies to mitigate this concern:

Don't leave progressive @override in place indefinitely. Migrate the field and remove the label argument from the @override directive as soon as reasonably possible.
Share labels across fields that are being migrated together. For example, if you are migrating Bill.amount and Bill.payment together, use the same label for both fields. This will ensure that the number of query plans does not increase as a result of the migration.
Use a small, known set of labels (for example percent(5), percent(25), percent(50)).

Customizing progressive `@override` behavior with a feature flag service

Out of the box, the router supports the percent(x) syntax for resolving labels based on a given percentage. Unfortunately, updating this number requires a subgraph publish and router redeploy. To avoid this, you can use a feature flag service to dynamically update the label value.

The router provides an interface for coprocessors and rhai scripts to resolve arbitrary labels. This allows you to dial up or disable a label's rollout status without requiring a subgraph publish. A coprocessor or rhai script that implements this should take the following steps:

Implement the SupergraphService
Inspect the apollo_override::unresolved_labels context key to determine which labels exist in the schema that haven't been resolved by the router.
Resolve the labels using your feature flag service (or any other mechanism).
Add the resolved labels to the apollo_override::labels_to_override context key.

Note: The unresolved labels are all labels in the schema that haven't been resolved by the router. They may not all pertain to the incoming operation. As a final step, the router will filter the resolved labels to only those that are relevant to the operation in order to minimize the set of labels contributing to the query plan cache key. It is expected that a coprocessor or rhai script will resolve all labels in the schema, not just those relevant to the operation.

For an example implementation of a coprocessor that resolves labels using LaunchDarkly, see the example in the router repo.

Optimizing for fewer deploys with manual composition

⚠️ This method requires careful coordination between subgraph and router updates. Without strict control over the order of deployments and schema updates, you might cause an outage. For most use cases, we recommend using the @override method above.

Using @override to migrate entity fields enables us to migrate fields incrementally with zero downtime. However, doing so requires three separate schema publishes. If you're using manual composition, each schema change requires redeploying your router. With careful coordination, we can perform the same migration with only a single router redeploy.

In the Billing subgraph, define the Bill entity, along with its corresponding resolvers. These new resolvers should behave identically to the Payment subgraph resolvers they're replacing.
Payments subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int!
payment: Payment
}

type Payment {
# ...
}
Billing subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int!
}
Deploy the updated Billing subgraph to your environment, but do not publish the updated schema yet.
- At this point, the Billing subgraph can successfully resolve Bill objects, but the router doesn't know this yet because its supergraph schema hasn't been updated. Publishing the schema would cause a composition error.
In the Payments subgraph, remove the migrated fields from the Bill entity and their associated resolvers (do not deploy this change yet):
Payments subgraph
type Bill @key(fields: "id") {
id: ID!
payment: Payment
}

type Payment {
# ...
}
Billing subgraph
type Bill @key(fields: "id") {
id: ID!
amount: Int!
}
Compose an updated supergraph schema with your usual configuration using rover supergraph compose.
- This updated supergraph schema indicates that the Billing subgraph resolves Bill.amount, and the Payments subgraph doesn't.
Assuming CI completes successfully, deploy an updated version of your router with the new supergraph schema.
- When this deployment completes, the router begins resolving Bill fields in the Billing subgraph instead of the Payments subgraph.
⚠️ While your new router instances are deploying, you will probably have active router instances resolving the Bill.amount field in two different ways (with older instances still resolving it from Payments). It's important that the two subgraphs resolve the field in exactly the same way, or your clients might see inconsistent data during this rollover.
Deploy the updated version of your Payments subgraph without the migrated field.
- At this point it's safe to remove this definition, because your router instances are using the Billing subgraph exclusively.

We're done! The migrated fields have been moved to a new subgraph, and we only redeployed our router once.

Contributing computed entity fields

You can define fields of an entity that are computed based on the values of other entity fields that are resolved by a different subgraph.

For example, this Shipping subgraph adds a shippingEstimate field to the Product entity. This field is calculated based on the product's size and weight, which are defined in the Products subgraph:

Shipping subgraph

1
type Product @key(fields: "id") {
2
  id: ID!
3
  size: Int @external
4
  weight: Int @external
5
  shippingEstimate: String @requires(fields: "size weight")
6
}

As shown, you use the @requires directive to indicate which fields (and subfields) from other subgraphs are required. You also need to define the required fields and apply the @external directive to them. This directive tells the router, "This subgraph knows that these fields exist, but it can't resolve them itself."

In the above example, if a query requests a product's shippingEstimate, the router does the following, in order:

It queries the Products subgraph for the product's size and weight.
It queries the Shipping subgraph for the product's shippingEstimate. The size and weight are included in the Product object passed to the resolver for shippingEstimate:

1
{
2
  Product: {
3
    shippingEstimate(product) {
4
      return computeShippingEstimate(product.id, product.size, product.weight);
5
    }
6
  }
7
}

Using `@requires` with object subfields

If a computed field @requires a field that returns an object type, you also specify which subfields of that object are required. You list those subfields with the following syntax:

Shipping subgraph

1
type Product @key(fields: "id") {
2
  id: ID!
3
  dimensions: ProductDimensions @external
4
  shippingEstimate: String @requires(fields: "dimensions { size weight }")
5
}

In this modification of the previous example, size and weight are now subfields of a ProductDimensions object. Note that the ProductDimensions type must be defined in both the Products and Shipping subgraphs for this to be valid.

Using `@requires` with fields that take arguments

This functionality was introduced in Federation v2.1.2.

The @requires directive can include fields that take arguments, like so:

Shipping subgraph

1
type Product @key(fields: "id") {
2
  id: ID!
3
  weight(units: String): Int @external
4
  shippingEstimate: String @requires(fields: "weight(units:\"KILOGRAMS\")")
5
}

The router provides the specified values in its query to whichever subgraph defines the required field.
Each specified argument value is static (i.e., the router always provides the same value).
You can omit values for nullable arguments. You must provide values for non-nullable arguments.
If you define your subgraph schema in an SDL file (instead of programmatically), you must escape quotes for string and enum values with backslashes (as shown above).

Resolving another subgraph's field

By default, exactly one subgraph is responsible for resolving each field in your supergraph schema (with important exceptions, like entity @key fields). But sometimes, multiple subgraphs are able to resolve a particular entity field, because all of those subgraphs have access to a particular data store. For example, an Inventory subgraph and a Products subgraph might both have access to the database that stores all product-related data.

You can enable multiple subgraphs to resolve a particular entity field. This is a completely optional optimization. When the router plans a query's execution, it looks at which fields are available from each subgraph. It can then attempt to optimize performance by executing the query across the fewest subgraphs needed to access all required fields.

You achieve this with one of the following directives:

@shareable
@provides

Which directive you use depends on the following logic:

If you aren't sure whether your subgraph can always resolve a field, see Using @provides for an example of a subgraph that can't.

Ensure resolver consistency

If multiple subgraphs can resolve a field, make sure each subgraph's resolver for that field behaves identically. Otherwise, queries might return inconsistent results to clients depending on which subgraph resolves the field.

This is especially important to keep in mind when making changes to an existing resolver. If you don't make the resolver changes to each subgraph simultaneously, clients might observe inconsistent results.

Common inconsistent resolver behaviors to look out for include:

Returning a different default value
Throwing different errors in the same scenario

Using `@shareable`

⚠️ Before using @shareable, see Ensure resolver consistency.

The @shareable directive indicates that a particular field can be resolved by more than one subgraph:

Products subgraph

type Product @key(fields: "id") {
  id: ID!
  name: String! @shareable
  price: Int
}

Inventory subgraph

type Product @key(fields: "id") {
  id: ID!
  name: String! @shareable
  inStock: Boolean!
}

In this example, both the Products and Inventory subgraphs can resolve Product.name. This means that a query that includes Product.name might be resolvable by fetching from fewer total subgraphs.

If a field is marked @shareable in any subgraph, it must be marked @shareable or @external in every subgraph that defines it. Otherwise, composition fails.

Using `@provides`

⚠️ Before using @provides, see Ensure resolver consistency.

The @provides directive indicates that a particular field can be resolved by a subgraph at a particular query path. Let's look at an example.

Here, our Products subgraph defines a Product.name field and marks it @shareable (this means other subgraphs are allowed to resolve it):

Products subgraph

1
type Product @key(fields: "id") {
2
  id: ID!
3
  name: String! @shareable
4
  price: Int
5
}

Meanwhile, our Inventory subgraph can also resolve a product's name, but only when that product is part of an InStockCount:

Inventory subgraph

1
type InStockCount {
2
  product: Product! @provides(fields: "name")
3
  quantity: Int!
4
}
5

6
type Product @key(fields: "id") {
7
  id: ID!
8
  name: String! @external
9
  inStock: Boolean!
10
}

Here we're using two directives in combination: @provides and @external.

The @provides directive tells the router, "This subgraph can resolve the name of any Product object returned by InStockCount.product."
The @external directive tells the router, "This subgraph can't resolve the name of a Product object, except wherever indicated by @provides."

Rules for using `@provides`

If a subgraph @provides a field that it can't always resolve, the subgraph must mark that field as @external and must not mark it as @shareable.
- Remember, a @shareable field can always be resolved by a particular subgraph, which removes the need for @provides.
To include a field in a @provides directive, that field must be marked as @shareable or @external in every subgraph that defines it.

Violating any of these rules causes composition to fail.

Handling the N+1 problem

Most subgraph implementations use reference resolvers (sometimes known as entity resolvers) to handle the Query._entities field ergonomically. A reference resolver is passed a single key and returns the entity object that corresponds to that key.

Although this pattern is straightforward, it can diminish performance when a client operation requests fields from many entities. To illustrate this, let's revisit an earlier example:

1
query GetReviewsWithProducts {
2
  latestReviews { # Defined in Reviews
3
    score
4
    product {
5
      id
6
      price # ⚠️ NOT defined in Reviews!
7
    }
8
  }
9
}

As mentioned in The query plan, the router executes two queries on its subgraphs to resolve the above operation:

It queries the Reviews subgraph to fetch all fields except Product.price.
It queries the Products subgraph to fetch the price of each Product entity.

In the Products subgraph, the reference resolver for Product doesn't take a list of keys, but rather a single key. Therefore, the subgraph library calls the reference resolver once for each key:

resolvers.js

1
// Products subgraph
2
const resolvers = {
3
  Product: {
4
    __resolveReference(productRepresentation) {
5
      return fetchProductByID(productRepresentation.id);
6
    }
7
  },
8
  // ...other resolvers...
9
}

A basic implementation of the fetchProductByID function might make a database call each time it's called. If we need to resolve Product.price for N different products, this results in N database calls. These calls are made in addition to the call made by the Reviews subgraph to fetch the initial list of reviews (and the id of each product). This is where the "N+1" problem gets its name. If not prevented, this problem can cause performance problems or even enable denial-of-service attacks.

This problem is not limited to reference resolvers! In fact, it can occur with any resolver that fetches from a data store. To handle this problem, we strongly recommend using the dataloader pattern. Nearly every GraphQL server library provides a dataloader implementation, and you should use it in every resolver. This is true even for resolvers that aren't for entities and that don't return a list. These resolvers can still cause N+1 issues via batched requests.

Entities (basics)

Entity interfaces