Transport temporary data between jobs using caching
Pipelines should assume that the cache is lost
The cache
keyword defines cache pushes and pulls
One job downloads or generates data and pushes to the cache
Subsequent jobs can pull from the cache to hit the ground running
Caches requires an object store…
… with an S3-compatible API