Railway•4mo ago

Node Memory and CPU Usage is low

I have a node instance that I'm running with tsx / node commands. I notice the job is slower when run on Railway than when on my local machine. Yet, the Railway memory and CPU usage are very low. How can I have the job use more CPU and memory on Railway?

10 Replies

Percy•4mo ago

Project ID: 98f66160-ce86-4b15-a73b-cab912256208

themkrage•4mo ago

98f66160-ce86-4b15-a73b-cab912256208

Brody•4mo ago

how low are we talking? what is the job execution time difference between hosted and local? what does this job do? please provide more information

themkrage•4mo ago

low, like 4-6 gb memory, and 1-2 vCPU on Railway the job never completes! I let it run for 3-4 straight days. Locally it completes in 12 hours The job makes a large amount of database calls, then parses location data. The database is not the blocker though, its the local compute that checks various @turf / geojson functions

Joseph U.•4mo ago

If you have a job that is running for 12 hours on a local machine then it's probably the sort of thing you may want to batch so its not an all or nothing job where you will lose work if the server resets on you or is disrupted for any other reason. If the batch chunks are small enough and you can monitor progress you can see exactly how much of a performance penalty you have for running in the cloud vs. on bare metal.

themkrage•4mo ago

Thanks for the idea! Is there some way to run like X workers per batch on Railway on different instances or something? It seems one railway instance is not enough to run this job any update @Brody

Brody•4mo ago

while its quite possible you have faster hardware locally, there is not enough evidence here to support the fact that this could outright be an issue with the platform. i would take joseph's and my advice into account, batched jobs and verbose debug logging.

themkrage•4mo ago

How can I do this with Railway? the batch will stills be run on the same railway hardware no? Can replica's handle this/

Brody•4mo ago

its something that you would need to implement in code, not really anything to do with the platform itself

ThallesComH•4mo ago

if you need to split your job into replicas, consider using a queue mechanism like bullmq. but tbh I would look more in detail on why its taking that long, 3 days difference is weird, is your database and worker on private networking? If it was me, I would start by adding Axiom and ploting a graph on db calls, locations parse, everything that takes time and see the diff between local and railway

Gaming

Programming

Node Memory and CPU Usage is low