A walk in GraphQL — Introduction

8 articles to learn GraphQL incrementally, keeping the implementation agnostic spirit of the SDL

Javier Valderrama
12 min readMay 20, 2021

Preface

A lot of information has been written about GraphQL since it was unleashed to the world by Facebook in 2015. As the curious nerds we are, we tend to embrace the self-taught approach first, which implies research, reading documents and tutorials, and more documents and tutorials, and trying/error in a continuous loop, until we feel confident with the topic.

The intention of this road map is to share with you a learning experience (one of many paths you can take to grok the language) and to draw you to the highest quality information (which we won’t be ever able to write in a better way) keeping you on the implementation agnostic spirit of the language, providing abstract examples as well as practical exercises (from #Day 1 on) in different languages so you can understand the correlations and differences without losing focus on the main topic: GraphQL.

You’ll walk through this starting from the language and the API definition on the backend, gradually increasing the complexity as the walk develops; it might contain new examples on the future, or more “days” including client-side, testing or architecture specific topic, or we might create another walk for them, we don’t really know.

Hopefully your walk will be much easier than ours!

TOC

  • What’s a graph?
  • What’s GraphQL and what’s the graph part all about
  • GraphQL vs RESTful
  • Schema Basics
  • SDL — Schema Definition Language
  • Named Types
  • Input and Output Types
  • Lists and Non-nullable
  • Root operation Types
  • Query
  • Mutation
  • Subscription
  • Resolvers

What’s a graph?

Without being too strict or too loose, we can say a graph is an abstract model to describe at least a pair of objects and their relationship.

In mathematics, and more specifically in graph theory, a graph is a structure amounting to a set of objects in which some pairs of the objects are in some sense “related”. The objects correspond to mathematical abstractions called vertices (also called nodes or points) and each of the related pairs of vertices is called an edge (also called link or line). Typically, a graph is depicted in diagrammatic form as a set of dots or circles for the vertices, joined by lines or curves for the edges. Graphs are one of the objects of study in discrete mathematics.

The edges may be directed or undirected. For example, if the vertices represent people at a party, and there is an edge between two people if they shake hands, then this graph is undirected because any person A can shake hands with a person B only if B also shakes hands with A. In contrast, if any edge from a person A to a person B corresponds to A owes money to B, then this graph is directed, because owing money is not necessarily reciprocated. The former type of graph is called an undirected graph while the latter type of graph is called a directed graph.

Source: Wikipedia — Graph (discrete mathematics)

A graph with three vertices and three edges.
A directed graph with three vertices and four directed edges (the double arrow represents an edge in each direction).

What’s GraphQL and what’s the graph part all about

Let’s try to go from the surface down to the rabbit hole.

The interface

Since repeating again and again the “unified interface to access data from different sources” thing (we’re polluting Google results with that phrase, lame) feels kinda dumb at this point; let’s try another analogy.

Imagine a shape sorter toy, that’s your contract.

  1. Anything going through (in/out) the shape sorter MUST match one of the defined shapes.
  2. Everything is described in terms of a shape, a shape’s property is a shape as well.
  3. It doesn’t care about how you get the blocks, granted #1
  4. It doesn’t care how you compose the blocks, granted #1

Without going in details with things like partial matches, etc., the above statement makes clear that.

  • From the client side you’ll know upfront the shapes you can request and retrieve.
  • From the server side you’ll know upfront the shapes you will be requested and have to deliver.

The relationships (graph)

Here it comes the graph thing.

If you go to the Thinking in Graphs section at graphql.org you’ll see the phrase “It’s Graphs All the Way Down” with a citation to the “Turtles all the way down” expression referring to the infinite regress problem. Despite the philosophical implications that you can explore if you want, what we can derive from that phrase is that it’s all about graphs!

Let’s see a small example:

Person is clearly a vertex or node, which has many properties related to it, hence we can describe the properties as nodes and the relationships as edges!!! All the way down! until no edges are found and all the data structure is returned.

Also did you noticed that friend property ( only one possible friend? that's cruel ) can hold a Person?

That means:

Got the point?
We could perform the following query:

and obtain

All the way down until the max depth defined on the server is reached (that’d be the only limitation)

Let’s take a look at the best explanation about this topic we’ve found: GraphQL Concepts Visualized by Dhaivat Pandya

GraphQL vs RESTful

There are a lot of documents on the web describing the “differences” between them, some are very precise, detailed and descriptive, but a lot are misleading or inaccurate (that’s why the quotes). It’s extremely important, before moving forward, to make some of those statements clear.

It:

  • WON’T lower down your database load, it doesn’t care how/where you persist your data
  • WON’T lower down the network traffic between your graphQL server and your persistence infrastructure per-se, that’s business logic and not graphQL … but
  • WILL tend to lower down the network traffic between the clients and the graphQL server
  • WON’T solve your messy/inconsistent data structure or your legacy mixed persistence infrastructure … but
  • WILL help you provide a consistent, unified and scalable contract for your data structure from graphQL to the clients.

That said, we’d like to share with you some enlightening insights from a Facebook Engineering platform’s post, see an extract below.

We evaluated our options for delivering News Feed data to our mobile apps, including RESTful server resources and FQL tables (Facebook’s SQL-like API). We were frustrated with the differences between the data we wanted to use in our apps and the server queries they required. We don’t think of data in terms of resource URLs, secondary keys, or join tables; we think about it in terms of a graph of objects and the models we ultimately use in our apps like NSObjects or JSON.

There was also a considerable amount of code to write on both the server to prepare the data and on the client to parse it. This frustration inspired a few of us to start the project that ultimately became GraphQL. GraphQL was our opportunity to rethink mobile app data-fetching from the perspective of product designers and developers. It moved the focus of development to the client apps, where designers and developers spend their time and attention.

Source: GraphQL: A data query language

It’s clear that the perspective shift from the persistence to the consumption point of view is at the GraphQL DNA, and that’s VERY CLEAR in the spec!

GraphQL has a number of design principles:

Hierarchical: Most product development today involves the creation and manipulation of view hierarchies. To achieve congruence with the structure of these applications, a GraphQL query itself is structured hierarchically. The query is shaped just like the data it returns. It is a natural way for clients to describe data requirements.

Product‐centric: GraphQL is unapologetically driven by the requirements of views and the front‐end engineers that write them. GraphQL starts with their way of thinking and requirements and builds the language and runtime necessary to enable that.

Strong‐typing: Every GraphQL server defines an application‐specific type system. Queries are executed within the context of that type system. Given a query, tools can ensure that the query is both syntactically correct and valid within the GraphQL type system before execution, i.e. at development time, and the server can make certain guarantees about the shape and nature of the response.

Client‐specified queries: Through its type system, a GraphQL server publishes the capabilities that its clients are allowed to consume. It is the client that is responsible for specifying exactly how it will consume those published capabilities. These queries are specified at field‐level granularity. In the majority of client‐server applications written without GraphQL, the server determines the data returned in its various scripted endpoints. A GraphQL query, on the other hand, returns exactly what a client asks for and no more.

Introspective: GraphQL is introspective. A GraphQL server’s type system must be queryable by the GraphQL language itself, as will be described in this specification. GraphQL introspection serves as a powerful platform for building common tools and client software libraries.

Source: GraphQL spec (June 2018 Edition)

And even tough it tends to be agnostic/silent regarding many aspects it’s still strongly opinionated regarding best practices as you can see here:

Schema Basics

SDL — Schema Definition Language

Even though GraphQL was internally used since 2012 and publicly released in 2015 we had to wait until 2018 for the Schema Definition Language (SDL) to became part of the specification. ( Source: Wikipedia GraphQL)

The Schema is a text document that follows the SDL syntax defined on the GraphQL specification, essentially a contract that declares, in the form of Types, the shape of your data graph and the operations you can perform with it.

For the time being (March, 2020) type definitions belong to one of the following categories.

Named Types

  • Scalar Type Scalar types represent primitive leaf values in a GraphQL type system. GraphQL responses take the form of a hierarchical tree; the leaves on these trees are GraphQL scalars.
  • Int Signed 32‐bit integer
  • Float Signed double-precision floating-point value
  • String UTF‐8 character sequence
  • Boolean true or false
  • ID (serialized as a String): A unique identifier that’s often used to refetch an object or as the key for a cache. Although it’s serialized as a String, an ID is not intended to be human‐readable.
  • Enum Type GraphQL Enum types, like scalar types, also represent leaf values in a GraphQL type system. However, Enum types describe the set of possible values.
  • Object Type While Scalar types describe the leaf values of these hierarchical queries, Objects describe the intermediate levels.
  • Input Object Type A GraphQL Input Object defines a set of input fields; the input fields are either scalars, enums, or other input objects. This allows arguments to accept arbitrarily complex structs.
  • Interface Type (abstract type) GraphQL interfaces represent a list of named fields and their arguments. GraphQL objects can then implement these interfaces which requires that the object type will define all fields defined by those interfaces.
  • Union Type (abstract type) GraphQL Unions represent an object that could be one of a list of GraphQL Object types, but provides for no guaranteed fields between those types. They also differ from interfaces in that Object types declare what interfaces they implement, but are not aware of what unions contain them.

Input and Output Types

Types are used throughout GraphQL to describe both the values accepted as input to arguments and variables as well as the values output by fields. These two uses categorize types as input types and output types. Some kinds of types, like Scalar and Enum types, can be used as both input types and output types; other kinds types can only be used in one or the other. Input Object types can only be used as input types. Object, Interface, and Union types can only be used as output types. Lists and Non‐Null types may be used as input types or output types depending on how the wrapped type may be used.

Lists

A GraphQL list is a special collection type which declares the type of each item in the List (referred to as the item type of the list). List values are serialized as ordered lists, where each item in the list is serialized as per the item type. To denote that a field uses a List type the item type is wrapped in square brackets like this: pets: [Pet].

Source: GraphQL Spec (June 2018)

Non-nullable

By default, all types in GraphQL are nullable; the null value is a valid response for all of the above types. To declare a type that disallows null, the GraphQL Non‐Null type can be used. This type wraps an underlying type, and this type acts identically to that wrapped type, with the exception that null is not a valid response for the wrapping type. A trailing exclamation mark is used to denote a field that uses a Non‐Null type like this: name: String!.

Source: GraphQL Spec (June 2018)

Combining List and Non-Null

The List and Non‐Null wrapping types can compose, representing more complex types. The rules for result coercion and input coercion of Lists and Non‐Null types apply in a recursive fashion.

Source: GraphQL Spec (June 2018)

Some examples here:

friends is non-nullable but may contain no Person entries (empty list)

friends is nullable but it must contain only Person entries when it's not empty

friends is non-nullable and all entries must be a Person type

Root Operation Types

A schema defines the initial root operation type for each kind of operation it supports: query, mutation, and subscription; this determines the place in the type system where those operations begin.

The query root operation type must be provided and must be an Object type.

The mutation root operation type is optional; if it is not provided, the service does not support mutations. If it is provided, it must be an Object type.

Similarly, the subscription root operation type is also optional; if it is not provided, the service does not support subscriptions. If it is provided, it must be an Object type.

Source: GraphQL Spec (June 2018 )

Despite the fact that they act as the entry points into the schema, they are just another Object Type, meaning their fields work the same way.

Query

The contract that defines how you can ask for the data and how it’ll be returned. (a read‐only fetch)

Mutation

The contract that defines how you can change the data and how it’ll be returned. (a write followed by a fetch)

Subscription

If the operation is a subscription, the result is an event stream called the “Response Stream” where each event in the event stream is the result of executing the operation for each new event on an underlying “Source Stream”.

Extending Types

At a certain point you might want to modularize your schema, and start separating the type definitions on a single document or dividing it into multiple documents. Since Type definitions are unique you have to extend them or use the schema compositions tools available on your runtime.

Resolvers

So far we’ve seen the “what” (type declarations on SDL form) but nothing about the “how”.

GraphQL doesn’t know how to turn an operation (query, mutation, subscription) into data unless you define those instructions by providing a set of functions or “resolver map” matching the same shape of the data specified in your schema. Those functions cannot be included on the schema, they have to be added separately and it will depend on your server application, the language you’ll use to define them, but they all MUST follow the same structure.

The simplest version of a set of resolvers for the previous examples might look like this (in javascript):

We’ll get into details on the next chapter, but for now it’s important to know that once you define your top-level resolvers (they have only the Root Operation above on the hierarchy), GraphQL will fall back to the default resolver (whenever no field-level resolver is defined ) and ultimately fail if the operation cannot be completed … and yes, resolvers can be asynchronous.

A walk in GraphQL Series at Medium

Github Pages

A Walk in GraphQL

Collaborators

  • Ezequiel Alvarez — @ealvarezk
    Reviewer
  • Ezequiel Tejerina — @quequitejerina
    Python exercises contributor and reviewer
  • Franco Gribaudo — @fgriba
    Java exercises contributor and reviewer
  • Cristian Buffa — @cristianbuffa & José Ignacio Aguilera — @jiaguilera
    NetCore exercises contributors and reviewers
  • Javier Valderrama — @Jaxolotl
    Author, JavaScript exercise contributor and reviewer

Other Learning Resources

--

--

Javier Valderrama
Javier Valderrama

Written by Javier Valderrama

Engineering manager, Lead Software engineer, passionate educator and community evangelist.