Skip to content

Extremely fast file change detector and task orchestrator for Node.js.

License

Notifications You must be signed in to change notification settings

gajus/turbowatch

Repository files navigation

Turbowatch 🏎

Extremely fast file change detector and task orchestrator for Node.js.

If you ever wanted something likeNodemonbut more capable, then you are at the right place.

Basic usage:

npm install turbowatch
cat>turbowatch.ts<<'EOD'
import { defineConfig } from 'turbowatch';

export default defineConfig({
project: __dirname,
triggers: [
{
expression: ['match', '*.ts', 'basename'],
name: 'build',
onChange: async ({ spawn }) => {
await spawn`tsc`;
},
},
],
});
EOD
npmexecturbowatch./turbowatch.ts

NoteSeelogginginstructions to print logs that explain what Turbowatch is doing.

Refer to recipes:

Turbowatch Nodemon
Node.js interface (scriptable) 1
Graceful termination (teardown) 2
Scriptable child processes (zx)
Retries
Debounce
Interruptible workflows
Concurrent workflows
Log grouping
Bring-your-own backend
Works with long-running processes
Works with build utilities and REPLs
Watch specific files or directories
Ignoring specific files or directories
Open source and available

1Undocumented
2Nodemon only provides the ability tosend a custom signalto the worker.

API

NotedefineConfigis used to export configuration for the consumption byturbowatchprogram. If you want to run Turbowatch programmatically, then usewatch.The API of both methods is equivalent.

Turbowatchdefaultsare a good choice for most projects. However, Turbowatch hasmanyoptions that you should be familiar with for advance use cases.

import{
watch,
typeChangeEvent,
}from'turbowatch';

voidwatch({
// Debounces triggers by 1 second.
// Most multi-file spanning changes are non-atomic. Therefore, it is typically desirable to
// batch together information about multiple file changes that happened in short succession.
// Provide { debounce: { wait: 0 } } to disable debounce.
debounce:{
wait:1000,
},
// The base directory under which all files are matched.
// Note: This is different from the "root project" (https://github /gajus/turbowatch#project-root).
project:__dirname,
triggers:[
{
// Expression match files based on name.
// https://github /gajus/turbowatch#expressions
expression:[
'allof',
['not',['dirname','node_modules']],
[
'anyof',
['match','*.ts','basename'],
['match','*.tsx','basename'],
]
],
// Indicates whether the onChange routine should be triggered on script startup.
// Defaults to false. Set it to false if you would like onChange routine to not run until the first changes are detected.
initialRun:true,
// Determines what to do if a new file change is detected while the trigger is executing.
// If {interruptible: true}, then AbortSignal will abort the current onChange routine.
// If {interruptible: false}, then Turbowatch will wait until the onChange routine completes.
// Defaults to true.
interruptible:false,
// Name of the trigger. Used for debugging
// Must match /^[a-z0-9-_]+$/ pattern and must be unique.
name:'build',
// Routine that is executed when file changes are detected.
onChange:async({spawn}:ChangeEvent)=>{
awaitspawn`tsc`;
awaitspawn`tsc-alias`;
},
// Routine that is executed when shutdown signal is received.
onTeardown:async({spawn})=>{
awaitspawn`rm -fr./dist`;
},
// Label a task as persistent if it is a long-running process, such as a dev server or --watch mode.
persistent:false,
// Retry a task if it fails. Otherwise, watch program will throw an error if trigger fails.
// Defaults to { retries: 3 }
retry:{
retries:3,
},
},
],
});

Motivation

To abstract the complexity of orchestrating file watching operations.

For context, we are usingTurborepo.The reason this project came to be is because Turborepo does not have "watch" mode (issue#986).

At first, we attempted to use a combination oftsc --watch,concurrentlyand Nodemon, but started to run into things breaking left and right, e.g.

  • services restarting prematurely (before all the assets are built)
  • services failing to gracefully shutdown and then failing to start, e.g. because ports are in use

Furthermore, the setup for each workspace was repetitive and not straightforward, and debugging issues was not a great experience because you have many workspaces running in watch mode producing tons of logs. Many of the workspaces being dependencies of each other, this kept re-triggering watch operations causing the mentioned issues.

In short, it quickly became clear that we need the ability to have more control over the orchestration of what/when needs to happen when files change.

We started with a script. At first I addeddebounce.That improved things. Then I addedgraceful terminationlogic, which mostly made everything work. We still had occasional failures due to out-of-order events, but addingretrylogic fixed that too... At the end, while we got everything to work, it took a lot of effort and it still was a collection of hacky scripts that are hard to maintain and debug, and that's how Turbowatch came to be –

Turbowatch is a toolbox for orchestrating and debugging file watching operations based on everything we learned along the way.

NoteIf you are working on a very simple project, i.e. just one build step or just one watch operation, thenyou don't need Turbowatch.Turbowatch is designed for monorepos or otherwise complex workspaces where you have dozens or hundreds of build steps that depend on each other (e.g. building and re-building dependencies, building/starting/stopping Docker containers, populating data, sending notifications, etc).

We alsoshared these learningswith Turborepo team in hopes that it will help to design an embedded file watching experience.

Use Cases

Turbowatch can be used to automate any sort of operations that need to happen in response to files changing, e.g.,

  • You can run (and conditionally restart) long-running processes (like your Node.js application)
  • You can build assets (like TypeScript and Docker images)

spawn

Turbowatch exposesspawnfunction that is an instance ofzx.Use it to evaluate shell commands:

async({spawn}:ChangeEvent)=>{
awaitspawn`tsc`;
awaitspawn`tsc-alias`;
},

The reason Turbowatch abstractszxis to enable graceful termination of child-processes when triggers are configured to beinterruptible.

Persistent tasks

Your setup may include tasks that are not designed to exit, e.g.next dev(starts Next.js in development mode).

It is important that these tasks are marked aspersistentto distinguish them from tasks that run to completion as that changes how Turbowatch treats them.

Persistent Non-Persistent
IgnoreFileChangeEventif{ interruptible: false }

Expressions

Expressions are used to match files. The most basic expression ismatch– it evaluates as true if a glob pattern matches the file, e.g.

Match all files with*.tsextension:

['match','*.ts','basename']

Expressions can be combined usingallofandanyof,e.g.,

Match all files with*.tsor*.tsxextensions:

[
'anyof',
['match','*.ts','basename'],
['match','*.tsx','basename']
]

Finally,notevaluates as true if the sub-expression evaluated as false, i.e. inverts the sub-expression.

Match all files with*.tsextension, but excludeindex.ts:

[
'allof',
['match','*.ts','basename'],
[
'not',
['match','index.ts','basename']
]
]

This is the gist behind Turbowatch expressions. However, there are many more expressions. InspectExpressiontype for further guidance.

typeExpression=
// Evaluates as true if all of the grouped expressions also evaluated as true.
|['allof',...Expression[]]
// Evaluates as true if any of the grouped expressions also evaluated as true.
|['anyof',...Expression[]]
// Evaluates as true if a given file has a matching parent directory.
|['dirname'|'idirname',string]
// Evaluates as true if a glob matches against the basename of the file.
|['match'|'imatch',string,'basename'|'wholename']
// Evaluates as true if the sub-expression evaluated as false, i.e. inverts the sub-expression.
|['not',Expression];

NoteTurbowatch expressions are a subset ofWatchman expressions.Originally, Turbowatch was developed to leverage Watchman as a superior backend for watching a large number of files. However, along the way, we discovered that Watchman does not support symbolic links (issue#105). Unfortunately, that makes Watchman unsuitable for projects that utilize linked dependencies (which is the direction in which the ecosystem is moving for dependency management in monorepos). As such, Watchman was replaced with chokidar. We are hoping to provide Watchman as a backend in the future. Therefore, we made Turbowatch expressions syntax compatible with a subset of Watchman expressions.

NoteTurbowatch usesmicromatchfor glob matching. Please note that you should be using forward slash (/) to separate paths, even on Windows.

Recipes

Rebuilding assets when file changes are detected

import{watch}from'turbowatch';

voidwatch({
project:__dirname,
triggers:[
{
expression:[
'allof',
['not',['dirname','node_modules']],
['match','*.ts','basename'],
],
name:'build',
onChange:async({spawn})=>{
awaitspawn`tsc`;
awaitspawn`tsc-alias`;
},
},
],
});

Restarting server when file changes are detected

import{watch}from'turbowatch';

voidwatch({
project:__dirname,
triggers:[
{
expression:[
'allof',
['not',['dirname','node_modules']],
[
'anyof',
['match','*.ts','basename'],
['match','*.graphql','basename'],
]
],
// Because of this setting, Turbowatch will kill the processes that spawn starts
// when it detects changes when it detects a change.
interruptible:true,
name:'start-server',
onChange:async({spawn})=>{
awaitspawn`tsx./src/bin/wait.ts`;
awaitspawn`tsx./src/bin/server.ts`;
},
},
],
});

Watchingnode_modules

There is more than one way to watchnode_modules.However, through trial and error we found that the following set of rules work the best for a generalized solution.

import{watch}from'turbowatch';

voidwatch({
project:path.resolve(__dirname,'../..'),
triggers:[
{
expression:[
'anyof',
[
'allof',
['dirname','node_modules'],
['dirname','dist'],
['match','*','basename'],
],
[
'allof',
['not',['dirname','node_modules']],
['dirname','src'],
['match','*','basename'],
],
],
name:'build',
onChange:async({spawn})=>{
returnspawn`pnpm run build`;
},
},
],
});

This setup makes an assumption that your workspaces sources are insrcdirectory andbuildtask outputs todistdirectory.

Reusing expressions

This might be common sense, but since Turbowatch scripts are regular JavaScript scripts, you can (and should) abstract your expressions and routines.

How you do it is entirely up to you, e.g. You could abstract just expressions or you could go as far as abstracting the entiretrigger:

import{watch}from'turbowatch';
import{
buildTrigger,
}from'@/turbowatch';

voidwatch({
project:__dirname,
triggers:[
buildTrigger(),
],
});

Such abstraction helps to avoid errors that otherwise may occur due to duplicative code across workspaces.

Reducing unnecessary reloads

Something that is important to consider when orchestrating file watching triggers is how to avoid unnecessary reloads. Consider if this was your "build" script:

rm -fr dist&&tsc&&tsc-alias

and let's assume that you are using an expression such as this one to detect when dependencies are updated:

[
'allof',
['dirname','node_modules'],
['dirname','dist'],
['match','*'],
],

Running this script will produce at least 3 file change events:

  1. whenrm -fr distcompletes
  2. whentsccompletes
  3. whentsc-aliascompletes

What's even worse is that even if the output has not changed, you are still going to trigger file change events (becausedistget replaced).

To some degree,debouncesetting helps with this. However, it will only help if there is no more than 1 second (by default) inbetween every command.

One way to avoid this entirely is by using an intermediate directory to output files and swapping only the files that changed. Here is how we do it:

rm -fr.dist&&tsc --project tsconfig.build.json&&rsync -cr --delete.dist/./dist/&&rm -fr.dist

This "build" script will always produce at most 1 event, and won't produce any events if the outputs have not changed.

This is not specific to Turbowatch, but something worth considering as you are designing your build pipeline.

Retrying failing triggers

Retries are configured by passing aretryproperty to the trigger configuration.

/**
* @property factor The exponential factor to use. Default is 2.
* @property maxTimeout The maximum number of milliseconds between two retries. Default is 30,000.
* @property minTimeout The number of milliseconds before starting the first retry. Default is 1000.
* @property retries The maximum amount of times to retry the operation. Default is 3. Seting this to 1 means do it once, then retry it once.
*/
typeRetry={
factor?:number,
maxTimeout?:number,
minTimeout?:number,
retries?:number,
}

Gracefully terminating Turbowatch

NoteSIGINTis automatically handled if you are usingturbowatchexecutable to evaluate your Turbowatch script. This examples shows how to programmatically gracefully shutdown Turbowatch if you choose not to useturbowatchprogram to evaluate your watch scripts.

WarningUnfortunately, many tools do not allow processes to gracefully terminate. There are open support issues for this in npm (#4603), pnpm (#2653) and yarn (#4667), but they haven't been addressed. Therefore, do not wrap yourturbowatchscript execution using these tools if you require processes to gracefully terminate.

watchreturns an instance ofTurbowatchController,which can be used to gracefully terminate the script:

const{shutdown}=awaitwatch({
project:__dirname,
triggers:[
{
name:'test',
expression:['match','*','basename'],
onChange:async({spawn})=>{
// `sleep 60` will receive `SIGTERM` as soon as `shutdown()` is called.
awaitspawn`sleep 60`;
},
}
],
});

// SIGINT is the signal sent when we press Ctrl+C
process.once('SIGINT',()=>{
voidshutdown();
});

Invokingshutdownwill propagate an abort signal to allonChangehandlers. The processes that were initiated usingspawnwill receiveSIGTERMsignal.

Gracefully terminating Turbowatch using anAbortController

In addition to being to Turbowatch using theshutdownroutine, Turbowatch instance can be shutdown using anAbortController.The main difference is thatshutdowncan be awaited to know when the shutdown routine has run to completion.

constabortController=newAbortController();

voidwatch({
abortController,
project:__dirname,
triggers:[
{
name:'test',
expression:['match','*','basename'],
onChange:async({spawn})=>{
// `sleep 60` will receive `SIGTERM` as soon as `shutdown()` is called.
awaitspawn`sleep 60`;
},
}
],
});

voidabortController.abort();

Handling theAbortSignal

Workflow might be interrupted in two scenarios:

  • when Turbowatch is being gracefully shutdown
  • when routine is marked asinterruptibleand a new file change is detected

Implementing interruptible workflows requires that you defineAbortSignalhandler. If you are usingzx,such abstraction could look like so:

NoteTurbowatch already comes withzxbound to theAbortSignal.Just usespawn.Documentation demonstrates how to implement equivalent functionality.

import{typeProcessPromise}from'zx';

constinterrupt=async(
processPromise:ProcessPromise,
abortSignal:AbortSignal,
)=>{
letaborted=false;

constkill=()=>{
aborted=true;

processPromise.kill();
};

abortSignal.addEventListener('abort',kill,{once:true});

try{
awaitprocessPromise;
}catch(error){
if(!aborted){
console.log(error);
}
}

abortSignal.removeEventListener('abort',kill);
};

which you can then use to kill your scripts, e.g.

exportdefaultwatch({
project:__dirname,
triggers:[
{
expression:['match','*.ts','basename'],
interruptible:false,
name:'sleep',
onChange:async({abortSignal})=>{
awaitinterrupt($`sleep 30`,abortSignal);
},
},
],
});

Tearing down project

onTeardownis going to be called when Turbowatch is gracefully terminated. Use it to "clean up" the project if necessary.

WarningThere is no timeout foronTeardown.

import{watch}from'turbowatch';

exportdefaultwatch({
abortController,
project:__dirname,
triggers:[
{
expression:['match','*.ts','basename'],
name:'build',
onChange:async({spawn})=>{
awaitspawn`tsc`;
},
onTeardown:async()=>{
awaitspawn`rm -fr./dist`;
},
},
],
});

Throttlingspawnoutput

When multiple processes are sending logs in parallel, the log stream might be hard to read, e.g.

redis:dev:973191cf >#5 sha256:7f65636102fd1f499092cb075baa95784488c0bbc3e0abff2a6d853109e4a948 4.19MB / 9.60MB 22.3s
api:dev:a1e4c6a7 > [18:48:37.171] 765ms debug @utilities#waitFor: Waiting for database to be ready...
redis:dev:973191cf >#5 sha256:d01ec855d06e16385fb33f299d9cc6eb303ea04378d0eea3a75d74e26c6e6bb9 0B / 1.39MB 22.7s
api:dev:a1e4c6a7 > [18:48:37.225] 54ms debug @utilities#waitFor: Waiting for Redis to be ready...
worker:dev:2fb02d72 > [18:48:37.313] 88ms debug @utilities#waitFor: Waiting for database to be ready...
redis:dev:973191cf >#5 sha256:7f65636102fd1f499092cb075baa95784488c0bbc3e0abff2a6d853109e4a948 5.24MB / 9.60MB 22.9s
worker:dev:2fb02d72 > [18:48:37.408] 95ms debug @utilities#waitFor: Waiting for Redis to be ready...
redis:dev:973191cf >#5 sha256:7f65636102fd1f499092cb075baa95784488c0bbc3e0abff2a6d853109e4a948 6.29MB / 9.60MB 23.7s
api:dev:a1e4c6a7 > [18:48:38.172] 764ms debug @utilities#waitFor: Waiting for database to be ready...
api:dev:a1e4c6a7 > [18:48:38.227] 55ms debug @utilities#waitFor: Waiting for Redis to be ready...

In this example,redis,apiandworkerprocesses produce logs at almost the exact same time causing the log stream to switch between outputting from a different process every other line. This makes it hard to read the logs.

By default, Turbowatch throttles log output to at most once a second per task, producing a lot more easier to follow log output:

redis:dev:973191cf >#5 sha256:7f65636102fd1f499092cb075baa95784488c0bbc3e0abff2a6d853109e4a948 4.19MB / 9.60MB 22.3s
redis:dev:973191cf >#5 sha256:d01ec855d06e16385fb33f299d9cc6eb303ea04378d0eea3a75d74e26c6e6bb9 0B / 1.39MB 22.7s
redis:dev:973191cf >#5 sha256:7f65636102fd1f499092cb075baa95784488c0bbc3e0abff2a6d853109e4a948 5.24MB / 9.60MB 22.9s
redis:dev:973191cf >#5 sha256:7f65636102fd1f499092cb075baa95784488c0bbc3e0abff2a6d853109e4a948 6.29MB / 9.60MB 23.7s
api:dev:a1e4c6a7 > [18:48:37.171] 765ms debug @utilities#waitFor: Waiting for database to be ready...
api:dev:a1e4c6a7 > [18:48:37.225] 54ms debug @utilities#waitFor: Waiting for Redis to be ready...
api:dev:a1e4c6a7 > [18:48:38.172] 764ms debug @utilities#waitFor: Waiting for database to be ready...
api:dev:a1e4c6a7 > [18:48:38.227] 55ms debug @utilities#waitFor: Waiting for Redis to be ready...
worker:dev:2fb02d72 > [18:48:37.313] 88ms debug @utilities#waitFor: Waiting for database to be ready...
worker:dev:2fb02d72 > [18:48:37.408] 95ms debug @utilities#waitFor: Waiting for Redis to be ready...

However, this means that some logs might come out of order. To disable this feature, set{ throttleOutput: { delay: 0 } }.

Watching multiple scripts

By default,turbowatchwill look forturbowatch.tsscript in the current working directory. However, you can pass multiple scripts toturbowatchto run them concurrently:

turbowatch./foo.ts./bar.ts

You can also provide a glob pattern:

turbowatch'**/turbowatch.ts'

Using custom file watching backend

Many of the existing file watching solutions come with tradeoffs, e.g. Watchman does not track symbolic links (#105), chokidar is failing to register file changes (#1240),fs.watchbehavior is platform specific, etc. For this reason, Turbowatch provides several backends to choose from and allows to bring-your-own backend by implementingFileWatchingBackendinterface.

By default, Turbowatch usesfs.watchon MacOS (Node.js v19.1+) and fallsback tochokidaron other platforms.

import{
watch,
// Smart Watcher that detects the best available file-watching backend.
TurboWatcher,
// fs.watch based file watcher.
FSWatcher,
// Chokidar based file watcher.
ChokidarWatcher,
// Interface that all file watchers must implement.
FileWatchingBackend,
}from'turbowatch';

exportdefaultwatch({
Watcher:TurboWatcher,
project:__dirname,
triggers:[],
});

Logging

Turbowatch usesRoarrlogger.

ExportROARR_LOG=trueenvironment variable to enable log printing tostdout.

Use@roarr/clito pretty-print logs.

ROARR_LOG=true turbowatch|roarr

Experiments

These are features that are available behind feature flags (TURBOWATCH_EXPERIMENTAL_*).

They are released to gather community feedback and may change at any point in future.

NoteThere are no active experiments at the moment.

Alternatives

The biggest benefit of using Turbowatch is that it provides a single abstraction for all file watching operations. That is, you might get away with Nodemon, concurrently,--watch,etc. running in parallel, but using Turbowatch will introduce consistency to how you perform watch operations.

Why not useX --watch?

Many tools provide built-in watch functionality, e.g.tsc --watch.However, there are couple of problems with relying on them:

  • Running many file watchers is inefficient and is probably draining your laptop's battery faster than you realize. Turbowatch uses a single server to watch all file changes.
  • Native tools do not allow to combine operations, e.g. If your build depends ontsc --watchandtsc-alias --watch,then you cannot combine them. On the other hand, Turbowatch allows you to chain arbitrary operations.

NoteTurbowatch is not a replacement for services that implement Hot Module Replacement (HMR), e.g. Next.js. However, you should still wrap those operations in Turbowatch for consistency, e.g.

voidwatch({
project:__dirname,
triggers:[
{
expression:['dirname',__dirname],
// Marking this routine as non-interruptible will ensure that
// next dev is not restarted when file changes are detected.
interruptible:false,
name:'start-server',
onChange:async({spawn})=>{
awaitspawn`next dev`;
},
// Enabling this option modifies what Turbowatch logs and warns
// you if your configuration is incompatible with persistent tasks.
persistent:true,
},
],
});

Why not concurrently?

I haveseenconcurrentlyused to "chain" watch operations such as:

concurrently"tsc -w""tsc-alias -w"

While this might work by brute-force, it will produce unexpected results as the order of execution is not guaranteed.

If you are using Turbowatch, simply execute one command after the other in the trigger workflow, e.g.

async({spawn}:ChangeEvent)=>{
awaitspawn`tsc`;
awaitspawn`tsc-alias`;
},

Why not Turborepo?

Turborepocurrently does not have support for watch mode (issue#986). However, Turbowatch has been designed to work with Turborepo.

To use Turbowatch with Turborepo:

  1. define a persistent task
  2. run the persistent task using--parallel

Example:

"dev":{
"cache":false,
"persistent":true
},
turbo run dev --parallel

NoteWe found that usingdependsOnwith Turbowatch produces undesirable effects. Instead, simply use Turbowatch expressions to identify when dependencies update.

NoteTurbowatch is not aware of the Turborepo dependency graph. Meaning, that your builds might fail at the first attempt. However, if you setup Turbowatch towatchnode_modules,then Turbowatch will automatically retry failing builds as soon as the dependencies are built.