82

This may make some people pull their hair out, but I’d love to hear some arguments. I’ve had the impression that people really don’t like bash, not from here, but just from people I’ve worked with.

There was a task at work where we wanted something that’ll run on a regular basis, and doesn’t do anything complex aside from reading from the database and sending the output to some web API. Pretty common these days.

I can’t think of a simpler scripting language to use than bash. Here are my reasons:

  • Reading from the environment is easy, and so is falling back to some value; just do ${VAR:-fallback}; no need to write another if-statement to check for nullity. Wanna check if a variable’s set to something expected? if [[ <test goes here> ]]; then <handle>; fi
  • Reading from arguments is also straightforward; instead of a import os; os.args[1] in Python, you just do $1.
  • Sending a file via HTTP as part of an application/x-www-form-urlencoded request is super easy with curl. In most programming languages, you’d have to manually open the file, read them into bytes, before putting it into your request for the http library that you need to import. curl already does all that.
  • Need to read from a curl response and it’s JSON? Reach for jq.
  • Instead of having to set up a connection object/instance to your database, give sqlite, psql, duckdb or whichever cli db client a connection string with your query and be on your way.
  • Shipping is… fairly easy? Especially if docker is common in your infrastructure. Pull Ubuntu or debian or alpine, install your dependencies through the package manager, and you’re good to go. If you stay within Linux and don’t have to deal with differences in bash and core utilities between different OSes (looking at you macOS), and assuming you tried to not to do anything too crazy and bring in necessary dependencies in the form of calling them, it should be fairly portable.

Sure, there can be security vulnerability concerns, but you’d still have to deal with the same problems with your Pythons your Rubies etc.

For most bash gotchas, shellcheck does a great job at warning you about them, and telling how to address those gotchas.

There are probably a bunch of other considerations but I can’t think of them off the top of my head, but I’ve addressed a bunch before.

So what’s the dealeo? What am I missing that may not actually be addressable?

you are viewing a single comment's thread
view the rest of the comments
[-] MajorHavoc@programming.dev 1 points 2 days ago

Sure.

I'll pick on postgres because it's popular. But I have found that most databases have a similar number of error codes.

https://www.postgresql.org/docs/current/errcodes-appendix.html

It's not an specific error that's the issue, it's the sheer variety of ways things can go wrong, combined with bash not having been architected with the database access use case in mind.

[-] Badland9085@lemm.ee 1 points 2 days ago

I find this argument somewhat weak. You are not going to run into the vast majority of those errors (in fact, some of them are not even errors, and you will probably never run into some of those errors as Postgres will not return them, eg some error codes from the sql standard). Many of them will only trigger if you do specific things: you started a transaction, you’ll have to handle the possible errors that comes with having a transaction.

There are lots of reasons to never use bash to connect to a db to do things. Here are a couple I think of that I think are fairly basic that some may think they can just do in bash.

  • Write to more than 1 table.
  • Write to a table that has triggers, knowing that you may get a trigger failure.
  • Use transactions.
  • Calling a stored procedure that will raise exceptions.
  • Accepting user input to write that into a table.

One case that I think is fine to use bash and connect to a db is when all you need to do a SELECT. You can test your statement in your db manager of choice, and bring that into bash. If you need input sanitization to filter results, stop, and use a language with a proper library. Otherwise, all the failure cases I can think of are: a) connection fails for whatever reason, in which case you don’t get your data, you get an exit code of 1, log to stderr, move on, b) your query failed cause of bad sql, in which case, well, go back to your dev loop, no?

This is why I asked what sort of problems have you ran into before, assuming you haven’t been doing risky things with the connection. I’m sorry, but I must say that I’m fairly disappointed by your reply.

[-] MajorHavoc@programming.dev 1 points 2 days ago* (last edited 2 days ago)

This is why I asked what sort of problems have you ran into before,

Lol. I'm fucking old. I don't remember details.

assuming you haven’t been doing risky things with the connection.

Ha! Not a safe assumption, though. I've maintained even more shitty code than I've written, and that's a lot! Lol.

[-] MajorHavoc@programming.dev 1 points 2 days ago* (last edited 2 days ago)

I find this argument somewhat weak.

Lol. Me too. I was just trying to give the shorthand version.

Your explanation is much better.

Edit: but it doesn't sound like you really needed a detailed answer from me, anyway.

[-] Badland9085@lemm.ee 2 points 1 day ago

I actually love listening to or reading someone else’s war story, and tbh the entire purpose of this post is to dig those up. Bash is one of those places where a lot about it is passed around as tribal knowledge. So I’d really love to hear how things have failed.

[-] MajorHavoc@programming.dev 1 points 1 day ago

Fair enough.

Here's what I remember: invoking SQL containing inserts from bash has resulted in lost data, when fairly unsurprising database things happened, since bash didn't really expect to be in charge of logging the details of the attempted change. For the error, it wasn't something surprising - maybe it was "max connections reached", stuff that will just happen sometimes.

The data loss was probably solveable in bash, but the scripter didn't think to (and probably would have needed more effort in a full development tool).

[-] Badland9085@lemm.ee 1 points 1 day ago

Seems like something that can happen in any languages, though yeah, bash doesn’t make it easier, and it’ll depend on what the cli tool would return given the error (eg does it return some code in stdout or stderr, or some non-zero exit code). Depending on the library (in the language of choice), you may still have to handle such errors manually, eg adding the necessary logic to retry.

And in such a case, I guess it would be prudent to either make sure that the data can be retrieved again, or push it somewhere a bit more permanent (shared fs, or object storage), sort of in a dead-letter-esque style. Seems like the lesson here is to have a fall over plan. The failure mode is not something a proper language and library would necessarily help discover more easily though.

this post was submitted on 15 Jan 2025
82 points (97.7% liked)

Programming

17831 readers
165 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev



founded 2 years ago
MODERATORS