hypermedia-systems/book/CH11_JSONDataAPIs.adoc
2022-11-19 10:15:56 -07:00

519 lines
29 KiB
Plaintext
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

= JSON Data APIs & Hypermedia Driven Applications
:chapter: 11
:url: ./json-data-apis/
This chapter covers
* Contrasting JSON-based Data APIs with hypermedia APIs
* Adding a Data API to our application
* Adding hypermedia controls to our JSON Data API
[partintro]
== JSON Data APIs
In this book we have been focusing on using hypermedia to build Hypermedia Driven Applications. In doing so we are
following the original networking architecture of the web, and building a REST-ful system, in the original sense
of that term.
However, today, we should acknowledge that many web applications are _not_ built using this approach. Instead, they use a
Single Page Application front end library such as React to build their application, and they interact with the server
via a JSON API. This JSON API almost never uses any hypermedia concepts. Rather JSON APIs tend to be a _Data API_, that
is, an API that simply returns structured domain data to the client without any hypermedia control information. The client
itself must know how to interpret the JSON Data: what end points are associated with the data, how certain fiels should
be displayed, and so on.
Now, believe it or not, we _have_ been creating an API for our web application. This may sound confusing to you!
An API? We have just been creating a web application, with paths that return HTML. How is that an API?
It turns out that our web application is, indeed, providing an API. It just happens to be an API that a _hypermedia client_,
that is, a browser, understands. We are building an API for the browser to interact with over HTTP, and, thanks to the
magic of HTML and hypermedia, the browser doesn't need to know anything about our hypermedia API beyond an entry point
URL: all the actions and display information comes, self-contained, within the HTML responses!
Building web applications like this is so natural and simple that you might not think of it as an API at all, but I
assure you, it is!
== Hypermedia APIs & JSON Data APIs
OK, wo, we have a hypermedia API for Contact.app. Should we include a Data API as well?
Sure! The existence of a hypermedia API _in no way means_ that you can't _also_ have a Data API. In fact, this is a
common situation in traditional web applications: there is the "web application" that is entered through that entry point
URL, say `https://mywebapp.example.com/`. And there is also a _separate_ JSON API that is accessible through another
URL, perhaps `https://api.mywebapp.example.com/v1`.
This is a perfectly reasonable way to split up the hypermedia interface to your application and the Data API you provide
to other, non-hypermedia clients.
Now, why would you want to include a Data API along with a hypermedia API? Well, because non-hypermedia clients want to
interact with your application as well, of course! For example:
* Perhaps you have a mobile application that isn't built using HyperView. That application will need to interact with
your server somehow, and using the existing HTML API would almost certainly be a poor fit! You want programmatic
access to your system via a Data API, and JSON is a natural choice for this.
* Perhaps you have automated script that needs to interact with the system on a regular basis. For example, maybe we
have a bulk-import job that runs nightly, and needs to import/sync thousands of contacts. While it would be possible
to script this agains the HTML API, it would also be annoying: parsing HTML in scripts is error prone and tedious. It
would be better to have a simple JSON API for this use case.
* Perhaps there are 3rd party clients who wish to integrate with your system's data in some way. Maybe a partner
wants to synchronize data nightly. As with the bulk-import example, this isn't a great use case for an HTML-based API,
and it would make more sense to provide something more amenable to scripting.
For all of these use cases, a JSON Data API makes sense: in each case the API is not being consumed by a hypermedia client,
so presenting am HTML-based hypermedia API would be inefficient and complicated. A simple JSON Data API fits the bill for what
we want and, as always, we recommend using the right tool for the job.
."What!?! You Want Me To Parse HTML!?!"
****
A confusion we often run into in online discussions is that, when we advocate a hypermedia approach to building web
applications, is that people think we mean that they should parse the HTML responses from the server, and then dump the
data into their SPA framework or mobile applications.
This is, of course, silly.
What we mean, instead, is that you should consider using a hypermedia API _with a hypermedia
client_, like the browser, interpreting the hypermedia response itself and presenting it to the user. A hypermedia API
can't simply be welded on top of an existing SPA approach. It requires a sophisticated hypermedia client such as
the browser to be consumed effectively.
If you are writing code to tease apart your hypermedia only to then treat as data to feed into a client-side model ,
you are probably doing it wrong!
****
=== Differences Between Hypermedia APIs & Data APIs
So, OK, let's accept for a moment that we _are_ going to have a Data API for our application, in addition to our
hypermedia API. At this point, some developers may be wondering: whyhave both? Why not have a single API, the JSON Data API, and have multiple clients use this one API to communicate
with it? Isn't it redundant to have both types of APIs for our application?
It's a reasonable point: we do advocate having multiple APIs to your web application if necessary and, yes, this may
lead to some redundancy in code. However, there are distinct advantages to both sorts of API and, even more so,
distinct requirements for both sorts of APIs. By supporting both of these types of APIs separately you can get the
strengths of both, while keeping their varying styles of code and infrastructure needs cleanly split out.
Let's contrast the needs of JSON APIs with Hypermedia APIs:
[cols="a,a"]
|===
|JSON API Needs |Hypermedia API
|
* It must remain stable over time: you cannot change the API willy nilly or you risk breaking clients that use the API
and expect certain end points to behave in certain ways
* It must be versioned: related to the first point, when you do make a major change, you need to version the API so
that clients that are using the old API continues to work
* It should be rate limited: since data APIs are often used by other clients, not just your own internal web application,
requests should be rate limited, often by user, in order to avoid a single client overloading the system
* It should be a general API: since the API is for _all_ clients, not just for your web application, you should avoid
specialized end points that are driven by your own application needs. Instead, the API should be general and expressive
enough to satisfy as many potential client needs as possible.
* Authentication for these sorts of API is typically token based, which we will discuss in more detail later
|
* There is no need to remain stable over time: all URLs are discovered via HTML responses, so you can be much more aggressive
in changing the shape of a hypermedia API
* This means that versioning is not an issue, another strength of the hypermedia approach
* Rate limiting probably isn't as important beyond the prevention of Distributed Denial of Service (DDoS) attacks
* The API can be _very specific_ to your application needs: since it is designed only for your particular web application,
and since the API is discovered through hypermedia, you can add and remove highly tuned end points for specific
features or optimization needs in your application
* Authentication is typically managed through a session cookie established by a login page
|===
These two different types of APIs have different strengths and needs, so it makes sense to use both. The hypermedia
approach can be used for your web application, allowing you to specialize the API for the "shape"
of your application. The Data API approach can be used for other, non-hypermedia clients like mobile, integration
partners, etc.
Note in particular that, by splitting these two APIs apart from one another, you reduce the pressure that running your web
application through your general Data API produces to be constantly changing the API to address application needs. Rather
than being thrashed around with every feature change, your Data API can focus on remaining stable and reliable.
This is the core strength splitting your Data API from your Hypermedia API, in our opinion.
.JSON Data APIs vs JSON "REST" APIs
****
Unfortunately, today, for historical reasons, what we are calling JSON Data APIs are often referred to
"REST APIs" in the industry. This is ironic, because, by any reasonable reading of Roy Fielding's work defining what REST
means, the vast majority of JSON APIs are _not_ REST-ful. Not even close!
[quote, Roy Fielding, https://roy.gbiv.com/untangled/2008/rest-apis-must-be-hypertext-driven]
____
I am getting frustrated by the number of people calling any HTTP-based interface a REST API. Todays example is the
SocialSite REST API. That is RPC. It screams RPC. There is so much coupling on display that it should be given an X rating.
What needs to be done to make the REST architectural style clear on the notion that hypertext is a constraint? In other
words, if the engine of application state (and hence the API) is not being driven by hypertext, then it cannot be RESTful
and cannot be a REST API. Period. Is there some broken manual somewhere that needs to be fixed?
____
The story of how "REST API" came to mean "JSON APIs" in the industry is a long and sordid long one, and beyond the
scope of this book. However, if you are interested, you can refer to an essay entitled "How Did REST Come To Mean The Opposite of
REST?" on the htmx website:
https://htmx.org/essays/how-did-rest-come-to-mean-the-opposite-of-rest/
In this book we will use the term "Data API" to describe these JSON APIs, while acknowledging that many people
in the industry will continue to call them "REST APIs" for the foreseeable future.
****
== Adding a JSON Data API To Contact.app
Alright, so how are we going to add a JSON Data API to our application? One approach, popularized by the Ruby on Rails
web framework, is to use the same URL endpoints as your hypermedia application, but use the HTTP `Accept` header to
determine if the client wants a JSON representation or an HTML representation. The HTTP `Accept` header allows a client
to specify what sort of Multipurpose Internet Mail Extensions (MIME) types, that is file types, it wants back from the
server: JSON, HTML, text and so on.
So, if the client wanted a JSON representation of all contacts, they might issue a `GET` request that looks like this:
[, reftext={chapter}.{counter:listing}]
.A Request for a JSON Representation of All Contacts
[source, http request]
----
Accept: application/json
GET /contacts
----
If we adopted this pattern then our request handler for `/contacts/ would need to be updated to inspect this header and,
depending on the value, return a JSON rather than HTML representation for the contacts. Ruby on Rails has support for
this pattern baked into the language, making it very easy to switch on the requested MIME type.
Unfortunately, our experience with this pattern has not been great, for reasons that should be clear given the
differences we outlined between Data and hypermedia APIs: they have different needs and often take on very different
"shapes", and trying to pound them into the same set of URLs ends up creating a lot of tension in the application code.
Given the different needs of the two APIs and our experience managing multiple APIs like this, we think separating the two
from one another, and, therefore, breaking the JSON Data API out to its own set of URLs is the right choice. This will allow us to evolve the two APIs separately from one another, and give us room
to improve each independently, in a manner consistent with their own individual strengths.
=== Picking a Root URL For Our API
Given that we are going to split our JSON Data API routes out from our regular hypermedia routes, where should we place
them? One important consideration here is that we want to make sure that we can version our API cleanly in some way,
regardless of the pattern we choose. Looking around, a lot of places end up using a sub-domain for their apis, something
like `https://api.mywebapp.example.com` and, in fact, often encode versioning in the subdomain:
`https://vi.api.mywebapp.example.com`.
While this makes sense for large companies, it seems like a bit of overkill for our modest little Contact.app. Rather
than using sub-domains, which are a pain for local development, we will use sub-paths within the existing application:
* We will use `/api` as the root for our Data API functionality
* We will use `/api/v1` as the entry point for version 1 of our Data API
If and when we decide to bump the API version, we can move to `/api/v2` and so on.
This approach isn't perfect, of course, but it will work for our simple application and can be adapted to a subdomain
approach or various other methods at a later point, when our Contact.app has taken over the internet and we can afford
a large team of API developers. :)
=== Our First JSON Endpoint: Listing All Contacts
Let's add our first Data API End point. It will handle an HTTP `GET` request to `/api/v1/contacts`, and return
a JSON list of all contacts in the system. In some ways it will look quite a bit like our initial code for the
hypermedia route `/contacts`: we will load all the contacts from the contacts database and then render some text
as a response.
We are also going to take advantage of a nice feature of Flask: if you simply return an object from a handler, it will
serialized (that is, convert) that object into a JSON response. This makes it very easy to build simple JSON APIs
in flask!
Here is our code:
.A JSON Data API To Return All Contacts
[source, python]
----
@app.route("/api/v1/contacts", methods=["GET"]) <1>
def json_contacts():
contacts_set = Contact.all()
contacts_dicts = [c.__dict__ for c in contacts_set] <2>
return {"contacts": contacts_dicts} <3>
----
<1> JSON API gets its own path, starting with `/api`
<2> Convert the contacts array into an array of simple dictionary (map) objects
<3> Return a dictionary that contains a `contacts` property of all the contacts
This Python code might look a little foreign to you if you are not a Python developer, but all we are doing is converting
our contacts into an array of simple name/value pairs and returning that array in an enclosing object as the `contacts`
property. This object will be serialized into a JSON response automatically by Flask.
With this in place, if we make an HTTP `GET` request to `/api/v1/contacts`, we will see a response that looks something
like this:
.Some Sample Data From Our API
[source, json]
----
{
"contacts": [
{
"email": "carson@example.comz",
"errors": {},
"first": "Carson",
"id": 2,
"last": "Gross",
"phone": "123-456-7890"
},
{
"email": "joe@example2.com",
"errors": {},
"first": "",
"id": 3,
"last": "",
"phone": ""
},
...
----
So, you can see, we now have a way to get a relatively simple JSON representation of our contacts via an HTTP request.
Not perfect, but good enough for the purposes of this book! It's certainly good enough to write some basic automated
scripts against. For example:
* You could move your contacts to another system on a nightly basis
* You could back your contacts up to a local file
* You could automate an email blast to your contacts
Having this small JSON Data API opens up a lot of automation possibilities that would be messier to achieve with our existing
hypermedia API.
=== Adding Contacts
Let's move on the next piece of functionality: adding a new contact to the system. Once again, our code is going
to look similar in some ways to the code that we wrote for our normal web application. However, here we are also
going to see the JSON API and the hypermedia API for our web application begin to obviously diverge.
In the web application, we needed a separate path, `/contacts/new` to host the HTML form for creating a new contact. In
the web application we made the decision to issue a `POST` to that same path to keep things consistent.
In the case of the JSON API, there is no such path needed: the JSON API "just is": it doesn't need to provide any
hypermedia representation for creating a new contact. You simply know where to issue a `POST` to to create a contact,
likely through some provided documentation about the API, and that's it.
Because of that fact, we can put the "create" handler on the same path as the "list" handler: `/api/v1/contacts`, but
have it respond only to HTTP `POST` requests.
The code here is relatively straight forward: populate a new contact with the information from the `POST` request,
attempt to save it and, if it is not successful, show some error messages. Here is the code:
.Adding Contacts With Our JSON API
[source, python]
----
@app.route("/api/v1/contacts", methods=["POST"]) <1>
def json_contacts_new():
c = Contact(None, request.form.get('first_name'), request.form.get('last_name'), request.form.get('phone'),
request.form.get('email')) <2>
if c.save(): <3>
return c.__dict__
else:
return {"errors": c.errors}, 400 <4>
----
<1> This handler is on the same path as the first one for our JSON API, but handles `POST` requests
<2> We create a new Contact based on values submitted with the request
<3> We attempt to save the contact and, if successful, render it as a JSON object
<4> If the save is not successful, we render an object showing the errors, with a response code of `400 (Bad Request)`
In some ways similar to our `contacts_new()` handler from our web application (we are creating the contact and attempting
to save it) but in other ways very different:
* There is no redirection happening here on a successful creation, because we are not dealing with a hypermedia client
like the browser
* In the case of a bad request, we simply return an error response code, `400 (Bad Request)`. This is in contrast with
the web application, where we simply re-render the form with error messages in it.
It is these sorts of differences that, over time, build up and make the idea of keeping your JSON and hypermedia APIs
on the same set of URLs less and less appealing.
=== Viewing Contact Details
Next let's make it possible for a JSON API client to download the details for a single client. We will naturally use an
HTTP `GET` for this functionality and we will follow the convention we established for our regular web application, and
put the path at `/api/v1/contacts/<contact id>`, so, for example, if you want to see the details of the contact with the
id `42`, you would issue an HTTP `GET` to `/api/v1/contacts/42`.
This code is quite simple:
.Getting the Details of a Contact in JSON
[source, python]
----
@app.route("/api/v1/contacts/<contact_id>", methods=["GET"]) <1>
def json_contacts_view(contact_id=0):
contact = Contact.find(contact_id) <2>
return contact.__dict__ <3>
----
<1> Add a new `GET` route at the path we want to use for viewing contact details
<2> Look the contact up via the id passed in through the path
<3> Convert the contact to a dictionary, so it can be rendered as JSON response
Nothing too complicated: we look the contact up by ID, provided in the path to the controller, and look that contact up.
We then render it as JSON. You have to appreciate the simplicity of this code!
Next, let's add updating and deleting a contact as well.
=== Updating & Deleting Contacts
As with the create contact API end point, because there is no HTML UI to produce for them, we can reuse the
`/api/v1/contacts/<contact id>` path. We will use the `PUT` HTTP action for updating a contact and the `DELETE`
action for deleting one.
Our update code is going to look nearly identical to the create handler, except that, rather than creating a new contact,
we will look up the contact by ID and update its fields. In this sense we are just combining the code of the create
handler and the detail view handler.
.Updating A Contact With Our JSON API
[source, python]
----
@app.route("/api/v1/contacts/<contact_id>", methods=["PUT"]) <1>
def json_contacts_edit(contact_id):
c = Contact.find(contact_id) <2>
c.update(request.form['first_name'], request.form['last_name'], request.form['phone'], request.form['email']) <3>
if c.save(): <4>
return c.__dict__
else:
return {"errors": c.errors}, 400
----
<1> We handle `PUT` requests to the URL for a given contact
<2> Look the contact up via the id passed in through the path
<3> We update the contact's data from the values included in the request
<4> From here on the logic is identical to the `json_contacts_create()` handler
Once again, very regular and, thanks to the built-in functionality in Flask, simple to implement.
Let's look at deleting a contact now. This turns out to be even simpler: as with the update handler we are going to
look up the contact by id, and then, well, delete it. At that point we can return a simple JSON object indicating
success.
.Deleting A Contact With Our JSON API
[source, python]
----
@app.route("/api/v1/contacts/<contact_id>", methods=["DELETE"]) <1>
def json_contacts_delete(contact_id=0):
contact = Contact.find(contact_id)
contact.delete() <2>
return jsonify({"success": True}) <3>
----
<1> We handle `DELETE` requests to the URL for a given contact
<2> Look the contact up and invoke the `delete()` method on it
<3> Return a simple JSON object indicating that the contact was successfully deleted
And, with that, we have our simple little JSON Data API to live alongside our regular web application, nicely separated
out from the main web application, so it can evolve separately as needed.
=== Additional Data API Considerations
Now, we obviously have a lot more to do if we want to make this a production ready JSON API:
* We don't have any rate limiting, which is important for any publicly facing Data API to avoid abusive clients.
* Even more crucially, there is currently no authentication mechanism. (We don't have one for our web application either!)
* We currently don't support paging of our contact data.
* Lots of small issues that we aren't addressing, such as rendering a proper `404 (Not Found)` response if someone makes
a request with a contact id that doesn't exist.
A full discussion around all of these topics is beyond the scope of this book, but I'd like to focus in on one in
particular, authentication, in order to show the difference between our hypermedia and JSON API. In order to secure
our application we need to add _authentication_, some mechanism for determining who a request is coming from, and
also _authorization_, determining if they have the right to perform the request.
We will set authorization aside for now and consider only authentication.
==== Authentication in Web Applications
In the HTML web application world, authentication has traditionally been done via a login page that asks a user for
their username (often their email) and a password. This password is then checked against a database of (hashed)
passwords to establish that the user is who they say they are. If the password is correct, then a _session cookie_
is established, indicating who the user is. This cookie is then sent with every request that the user makes to
the web application, allowing the application to know which user is making a given request.
.HTTP Cookies
****
HTTP Cookies are kind of a strange feature of HTTP. In some ways they violate the goal of remaining stateless, a
major component of the REST-ful architecture: a server will often use a session cookie as an index into state kept
on the server "on the side", such as a cache of the last action performed by the user.
Nonetheless, cookies have proven extremely useful and so people tend not to complain about this aspect of them too much
(I'm not sure what our other options would be here!) An interesting example of pragmatism gone (relatively) right in
web development.
****
In comparison with the typical web application approach to authentication, a JSON API will typically use some sort of
_token based_ authentication: an authentication token will be established via a mechanism like OAuth, and that authentication
token will then be passed, often as an HTTP Header, with every request that a client makes.
At a high level this is similar to what happens in normal web application authentication: a token is established somehow
and then then token is part of every request. However, in practice, the mechanics tend to be wildly different:
* Cookies are part of the HTTP specification and can be easily _set_ by an HTTP Server
* JSON Authentication tokens, in contrast, often require elaborate exchange mechanics like OAuth to be established
These differing mechanics for establishing authentication are yet another good reason for splitting our JSON and hypermedia
APIs up.
=== The "Shape" of Our Two APIs
When we were building out our API, we noted that in many cases the JSON API didn't require as many end points as our
hypermedia API did: we didn't need a `/contacts/new` handler, for example, to provide a hypermedia representation for
creating contacts.
Another aspect of our hypermedia API to consider was the performance improvement we made: we pulled the total contact count
out to a separate end point and implemented the "Lazy Load" pattern, to improve the perceived performance of our
application.
Now, if we had both our hypermedia and JSON API sharing the same paths, would we want to publish this API as a JSON
end point as well?
Maybe, but maybe not. This was a pretty specific need for our web application, and, absent a request from a user of
our JSON API, it doesn't make sense to include it for JSON consumers.
And what if, by some miracle, the performance issues with `Contact.count()` that we were addressing with the Lazy Load
pattern goes away? Well, in our Hypermedia Drive Application we can simply revert to the old code and include the
count directly in the request to `/contacts`. We can remove the `contacts/count` end point and all the logic associated
with it. By the miracle of hypermedia, the system will continue to work just fine!
But what if we had tied our JSON API and hypermedia API together, and published `/contacts/count` as a supported end
point for our JSON API? In that case we couldn't simply remove the end point: a (non-hypermedia) client might be
relying on it!
Once again you can see the flexibility of the hypermedia approach and why separating your JSON API out from your
hypermedia API lets you take maximum advantage of that flexibility.
=== The Model View Controller (MVC) Paradigm
One thing you may have noticed about the handlers for our JSON API is that they are relatively simple and regular.
Most of the hard work of updating data and so forth is done within the contact model itself: the handlers act as simple
connectors that provide a go-between the HTTP requests and the model.
This is the ideal controller of the Model-View-Controller (MVC) paradigm that was so popular in the early web: a controller
should be "thin", with the model containing the majority of the logic in the system.
.The Model View Controller Pattern
****
The Model View Controller design pattern is a classic architectural pattern in software development, and was a major
influence in early web development. It is no longer emphasized as heavily, as web development has split into front-end
and back-end camps, but most web developers are still familiar with the idea.
Traditionally, the MVC pattern mapped into web development like so:
* Model - A collection of "domain" classes that implement all the logic and rules for the particular domain your application
is designed for. The model typically provides "resources" that are then presented to clients as HTML "representations".
* View - Typically views would be some sort of client-side templating system, and would render the aforementioned HTML representation
for a given Model instance.
* Controller - The controllers job is to take HTTP requests, convert them into sensible requests to the Model and forward
those requests on to the appropriate Model objects. It then passes the HTML representation back to the client as an
HTTP response.
****
Thin controllers make it easy to split your JSON and hypermedia APIs out, because all the important logic lives in the domain
model that is shared by both. This allows you to evolve both separately, while still keeping logic in sync with one
another. With properly built "thin" controllers and "fat" models, keeping two separate APIs both in sync and yet
still evolving separately is not as difficult or as crazy as it might sound at first.
== Summary
* In this chapter we saw that a Hypermedia Driven Application can have a JSON Data API as well
* Hypermedia APIs and JSON Data APIs have different needs and shapes
* By properly factoring your back end code, much of the logic can be shared between the two APIs
* By splitting your APIs into both a Hypermedia API and a JSON Data API, you can evolve both without interferring with
one another, allowing you to, for example, change your Hypermedia API dramatically without breaking your JSON API
clients