In this article, I will walk you through how to scale up your node.js application in the easiest way, that is the cluster to serve more requests.
Node.js runs on a single thread, it means by default, it runs on 1 CPU. If your application receives a lot of requests or it has a long live blocking function. Your entire application can be unresponsive although your server still has a lot of resources.
For example:
This simple Node.js application has 2 endpoints
javascriptconst express = require("express");const os = require("os");const app = express();const PORT = 3000;// Simple recursive Fibonacci (not optimized for large numbers)function fib(n) {if (n <= 1) return n;return fib(n - 1) + fib(n - 2);}// Endpoint to return the 45th Fibonacci numberapp.get("/fib", (req, res) => {const start = process.hrtime();const result = fib(45);const end = process.hrtime(start);const executionTime = (end[0] * 1000 + end[1] / 1000000).toFixed(2); // Convert to millisecondsres.json({fibonacciNumber: result,processingTime: `${executionTime}ms`,processId: process.pid,});});// Endpoint to return server infoapp.get("/info", (req, res) => {res.json({name: os.hostname(),numberOfCpus: os.cpus().length,processId: process.pid,});});// Start the serverapp.listen(PORT, () => {console.log(`Server is running on http://localhost:${PORT}`);});
Since the Get Fibonacci Number is the blocking function, and it is written poorly, whenever it is executed, it occupy the entire thread, making application is unresponsive to the requests. So if you place the second request to get server information, it returns the result only when the get fib number function is executed successfully
The application is not responding when you perform 2 requests at the same time, get fibbonaci number 45th, and get server information, even that the get info is very lightweight endpoint.
In the image below, demonstrate that for a server which has 10 Threads, one thread is 100%, while the other is just idle.
So it comes to the question, how can you use other other cpus to handle more requests, because is is obviously, there are 9 idles cpus, sitting and doing nothing, while one cpu is working ass off?
That is when cluster come for the rescue.
What is Node.js Cluster?
With a bit of refactoring, our code can become.
javascriptconst express = require("express");const os = require("os");const cluster = require("cluster");// Check if current process is primaryif (cluster.isPrimary) {const numCPUs = os.cpus().length;// Fork workers for each CPUfor (let i = 0; i < numCPUs; i++) {cluster.fork();}// Log when a worker dies and fork a new onecluster.on("exit", (worker, code, signal) => {console.log(`Worker ${worker.process.pid} died`);cluster.fork();});console.log(`Primary ${process.pid} is running`);} else {const app = express();const PORT = 3000;function fib(n) {if (n <= 1) return n;return fib(n - 1) + fib(n - 2);}app.get("/fib", (req, res) => {const start = process.hrtime();const result = fib(45);const end = process.hrtime(start);const executionTime = (end[0] * 1000 + end[1] / 1000000).toFixed(2);res.json({fibonacciNumber: result,processingTime: `${executionTime}ms`,processId: process.pid,});});// Endpoint to return server infoapp.get("/info", (req, res) => {res.json({name: os.hostname(),numberOfCpus: os.cpus().length,processId: process.pid,});});// Start the serverapp.listen(PORT, () => {console.log(`Worker ${process.pid} is running on http://localhost:${PORT}`);});}
As you can see, in this version, the code can run on multiple threads, with the number of threads equal to the number of cpus of the system.
so when you perform the
bashGET localhost:3000/fibGET localhost:3000/info
It can run at the time time, although the fib endpoint can make a CPU go to 100%, but the info still can be handled by other thread.
Each time you access the endpoint, you may noticed that the processId
will be different since it is handled by different threads
For the full code, you can checkout at:
https://github.com/mt26691/dalabs-sample-files/tree/main/nodejs-cluster-example