596 lines
13 KiB
Markdown
596 lines
13 KiB
Markdown
Bash Style Guide
|
||
================
|
||
|
||
This guide outlines how to write bash scripts with a style that makes them safe
|
||
and predictable. This guide is written by [Dave Eddy](https://daveeddy.com) as
|
||
part of the YSAP (You Suck at Programming) series [ysap.sh](https://ysap.sh) and
|
||
is the working document for how I approach bash scripting when it comes to
|
||
style, design, and best-practices.
|
||
|
||
Preface
|
||
-------
|
||
|
||
This guide will try to be as objective as possible, providing reasoning for why
|
||
certain decisions were made. For choices that are purely aesthetic (and may
|
||
not be universally agreeable) they will exist in the `Aesthetics` section
|
||
below.
|
||
|
||
Though good style alone won't ensure that your scripts are free from error, it
|
||
can certainly help narrow the scope for bugs to exist. This guide attempts to
|
||
explicitly state my style choices instead of implicitly relying on a sense or a
|
||
"vibe" of how code should be written.
|
||
|
||
Aesthetics
|
||
----------
|
||
|
||
### Tabs / Spaces
|
||
|
||
Tabs.
|
||
|
||
### Columns
|
||
|
||
Not to exceed 80.
|
||
|
||
### Semicolons
|
||
|
||
Avoid using semicolons in scripts unless required in control statements (e.g.,
|
||
if, while).
|
||
|
||
``` bash
|
||
# wrong
|
||
name='dave';
|
||
echo "hello $name";
|
||
|
||
# right
|
||
name='dave'
|
||
echo "hello $name"
|
||
```
|
||
|
||
The exception to this rule is outlined in the `Block Statements` section below.
|
||
Namely, semicolons should be used for control statements like `if` or `while`.
|
||
|
||
### Functions
|
||
|
||
Don't use the `function` keyword. All variables created in a function should
|
||
be made local.
|
||
|
||
``` bash
|
||
# wrong
|
||
function foo {
|
||
i=foo # this is now global, wrong depending on intent
|
||
}
|
||
|
||
# right
|
||
foo() {
|
||
local i=foo # this is local, preferred
|
||
}
|
||
```
|
||
|
||
### Block Statements
|
||
|
||
`then` should be on the same line as `if`, and `do` should be on the same line
|
||
as `while`.
|
||
|
||
``` bash
|
||
# wrong
|
||
if true
|
||
then
|
||
...
|
||
fi
|
||
|
||
# also wrong, though admittedly looks kinda cool
|
||
true && {
|
||
...
|
||
}
|
||
|
||
# right
|
||
if true; then
|
||
...
|
||
fi
|
||
```
|
||
|
||
### Spacing
|
||
|
||
No more than 2 consecutive newline characters (ie. no more than 1 blank line in
|
||
a row).
|
||
|
||
### Comments
|
||
|
||
No explicit style guide for comments. Don't change someones comments for
|
||
aesthetic reasons unless you are rewriting or updating them.
|
||
|
||
---
|
||
|
||
Bashisms
|
||
--------
|
||
|
||
This style guide is for bash. This means when given the choice, always prefer
|
||
bash builtins or keywords instead of external commands or `sh(1)` syntax.
|
||
|
||
### `test(1)`
|
||
|
||
Use `[[ ... ]]` for conditional testing, not `[ .. ]` or `test ...`
|
||
|
||
``` bash
|
||
# wrong
|
||
test -d /etc
|
||
|
||
# also wrong
|
||
[ -d /etc ]
|
||
|
||
# correct
|
||
[[ -d /etc ]]
|
||
```
|
||
|
||
- [YSAP066](https://ysap.sh/v/66)
|
||
See [BashFAQ031](http://mywiki.wooledge.org/BashFAQ/031) for more information
|
||
about these.
|
||
|
||
### Sequences
|
||
|
||
Use bash builtins for generating sequences
|
||
|
||
``` bash
|
||
n=10
|
||
|
||
# wrong
|
||
for f in $(seq 1 5); do
|
||
...
|
||
done
|
||
|
||
# wrong
|
||
for f in $(seq 1 "$n"); do
|
||
...
|
||
done
|
||
|
||
# right
|
||
for f in {1..5}; do
|
||
...
|
||
done
|
||
|
||
# right
|
||
for ((i = 0; i < n; i++)); do
|
||
...
|
||
done
|
||
```
|
||
|
||
- [YSAP052](https://ysap.sh/v/52/)
|
||
- [YSAP053](https://ysap.sh/v/53/)
|
||
|
||
### Command Substitution
|
||
|
||
Use `$(...)` for command substitution.
|
||
|
||
``` bash
|
||
foo=`date` # wrong
|
||
foo=$(date) # right
|
||
```
|
||
|
||
- [YSAP022](https://ysap.sh/v/22/)
|
||
|
||
### Math / Integer Manipulation
|
||
|
||
Use `((...))` and `$((...))`.
|
||
|
||
``` bash
|
||
a=5
|
||
b=4
|
||
|
||
# wrong
|
||
if [[ $a -gt $b ]]; then
|
||
...
|
||
fi
|
||
|
||
# right
|
||
if ((a > b)); then
|
||
...
|
||
fi
|
||
```
|
||
|
||
Do **not** use the `let` command.
|
||
|
||
### Parameter Expansion
|
||
|
||
Always prefer parameter expansion over external commands like `echo`, `sed`,
|
||
`awk`, etc.
|
||
|
||
``` bash
|
||
name='bahamas10'
|
||
|
||
# wrong
|
||
prog=$(basename "$0")
|
||
nonumbers=$(echo "$name" | sed -e 's/[0-9]//g')
|
||
|
||
# right
|
||
prog=${0##*/}
|
||
nonumbers=${name//[0-9]/}
|
||
```
|
||
|
||
- [YSAP026](https://ysap.sh/v/26/)
|
||
- [YSAP056](https://ysap.sh/v/56/)
|
||
|
||
### Listing Files
|
||
|
||
Do not [parse ls(1)](http://mywiki.wooledge.org/ParsingLs), instead use
|
||
bash builtin functions to loop files
|
||
|
||
``` bash
|
||
# very wrong, potentially unsafe
|
||
for f in $(ls); do
|
||
...
|
||
done
|
||
|
||
# right
|
||
for f in *; do
|
||
...
|
||
done
|
||
```
|
||
|
||
- [YSAP001](https://ysap.sh/v/1/)
|
||
|
||
### Determining path of the executable (`__dirname`)
|
||
|
||
Simply stated, you can't know this for sure. If you are trying to find out the
|
||
full path of the executing program, you should rethink your software design.
|
||
|
||
See [BashFAQ028](http://mywiki.wooledge.org/BashFAQ/028) for more information
|
||
|
||
For a case study on `__dirname` in multiple languages see my blog post
|
||
|
||
[Dirname Case
|
||
Study](http://daveeddy.com/2015/04/13/dirname-case-study-for-bash-and-node/)
|
||
|
||
### Arrays and lists
|
||
|
||
Use bash arrays instead of a string separated by spaces (or newlines, tabs,
|
||
etc.) whenever possible
|
||
|
||
``` bash
|
||
# wrong
|
||
modules='json httpserver jshint'
|
||
for module in $modules; do
|
||
npm install -g "$module"
|
||
done
|
||
|
||
# right
|
||
modules=(json httpserver jshint)
|
||
for module in "${modules[@]}"; do
|
||
npm install -g "$module"
|
||
done
|
||
```
|
||
|
||
Of course, in this example it may be better expressed as:
|
||
|
||
``` bash
|
||
npm install -g "${modules[@]}"
|
||
```
|
||
|
||
... only if the command supports multiple arguments and you are not interested
|
||
in catching individual failures.
|
||
|
||
- [YSAP020](https://ysap.sh/v/20)
|
||
- [Arrays explained in 7 minutes](https://www.youtube.com/watch?v=asHJ-xfuyno)
|
||
|
||
### read builtin
|
||
|
||
Use the bash `read` builtin whenever possible to avoid forking external
|
||
commands
|
||
|
||
Example
|
||
|
||
``` bash
|
||
fqdn='computer1.daveeddy.com'
|
||
|
||
IFS=. read -r hostname domain tld <<< "$fqdn"
|
||
echo "$hostname is in $domain.$tld"
|
||
# => "computer1 is in daveeddy.com"
|
||
```
|
||
|
||
---
|
||
|
||
External Commands
|
||
-----------------
|
||
|
||
### GNU userland tools
|
||
|
||
The whole world doesn't run on GNU or on Linux; avoid GNU specific options
|
||
when forking external commands like `awk`, `sed`, `grep`, etc. to be as
|
||
portable as possible.
|
||
|
||
When writing bash and using all the powerful tools and builtins bash gives you,
|
||
you'll find it rare that you need to fork external commands to do simple string
|
||
manipulation.
|
||
|
||
- [YSAP029](https://ysap.sh/v/29/)
|
||
|
||
### Useless Use of Cat Award
|
||
|
||
Don't use `cat(1)` when you don't need it. If programs support reading from
|
||
stdin, pass the data in using bash redirection.
|
||
|
||
``` bash
|
||
# wrong
|
||
cat file | grep foo
|
||
|
||
# right
|
||
grep foo < file
|
||
|
||
# also right
|
||
grep foo file
|
||
```
|
||
|
||
Prefer using a command line tools builtin method of reading a file instead of
|
||
passing in stdin. This is where we make the inference that, if a program says
|
||
it can read a file passed by name, it's probably more performant to do that.
|
||
|
||
- [Your Using `cat` Wrong](https://www.youtube.com/watch?v=vAK55aiRLeY)
|
||
- [UUOC](http://www.smallo.ruhr.de/award.html)
|
||
|
||
---
|
||
|
||
Style
|
||
-----
|
||
|
||
### Quoting
|
||
|
||
Use double quotes for strings that require variable expansion or command
|
||
substitution interpolation, and single quotes for all others.
|
||
|
||
``` bash
|
||
# right
|
||
foo='Hello World'
|
||
bar="You are $USER"
|
||
|
||
# wrong
|
||
foo="hello world"
|
||
|
||
# possibly wrong, depending on intent
|
||
bar='You are $USER'
|
||
```
|
||
|
||
All variables that will undergo word-splitting *must* be quoted (1). If no
|
||
splitting will happen, the variable may remain unquoted.
|
||
|
||
``` bash
|
||
foo='hello world'
|
||
|
||
if [[ -n $foo ]]; then # no quotes needed:
|
||
# [[ ... ]] won't word-split variable expansions
|
||
|
||
echo "$foo" # quotes needed
|
||
fi
|
||
|
||
bar=$foo # no quotes needed - variable assignment doesn't word-split
|
||
```
|
||
|
||
1. The only exception to this rule is if the code or bash controls the variable
|
||
for the duration of its lifetime. For example code like this:
|
||
|
||
``` bash
|
||
printf_date_supported=false
|
||
if printf '%()T' &>/dev/null; then
|
||
printf_date_supported=true
|
||
fi
|
||
|
||
if $printf_date_supported; then
|
||
...
|
||
fi
|
||
```
|
||
|
||
Even though `$printf_date_supported` undergoes word-splitting in the `if`
|
||
statement in that example, quotes are not used because the contents of that
|
||
variable are controlled explicitly by the programmer and not taken from a user
|
||
or command.
|
||
|
||
Also, variables like `$$`, `$?`, `$#`, etc. don't required quotes because they
|
||
will never contain spaces, tabs, or newlines.
|
||
|
||
When in doubt; [quote all expansions](http://mywiki.wooledge.org/Quotes).
|
||
|
||
- [YSAP021](https://ysap.sh/v/21/)
|
||
|
||
### Variable Declaration
|
||
|
||
Avoid uppercase variable names unless they 1. are constants or 2. are exported
|
||
to the environment using `export`. Don't use `let` or `readonly` to create
|
||
variables. `declare` should *only* be used for associative arrays. `local`
|
||
should *always* be used in functions.
|
||
|
||
``` bash
|
||
# wrong
|
||
declare -i foo=5
|
||
let foo++
|
||
readonly bar='something'
|
||
FOOBAR=baz
|
||
|
||
# right
|
||
i=5
|
||
((i++))
|
||
bar='something'
|
||
export FOOBAR=baz
|
||
```
|
||
|
||
### shebang
|
||
|
||
Bash is not always located at `/bin/bash`, so use this line:
|
||
|
||
``` bash
|
||
#!/usr/bin/env bash
|
||
```
|
||
|
||
Unless you’re intentionally targeting a specific environment (e.g. `/bin/bash`
|
||
on Linux servers with restricted PATHs).
|
||
|
||
- [Shebangs are Weird](https://www.youtube.com/watch?v=aoHMiCzqCNw)
|
||
|
||
### Error Checking
|
||
|
||
`cd`, for example, doesn't always work. Make sure to check for any possible
|
||
errors for `cd` (or commands like it) and exit or break if they are present.
|
||
|
||
``` bash
|
||
# wrong
|
||
cd /some/path # this could fail
|
||
rm file # if cd fails where am I? what am I deleting?
|
||
|
||
# right
|
||
cd /some/path || exit
|
||
rm file
|
||
```
|
||
|
||
### Using `set -e`
|
||
|
||
Don't set `errexit`. Like in C, sometimes you want an error, or you expect
|
||
something to fail, and that doesn't necessarily mean you want the program
|
||
to exit.
|
||
|
||
This is a controversial opinion that I have on the surface, but the link below
|
||
will show situations where `set -e` can do more harm than good because of its
|
||
implications.
|
||
|
||
- [The Problem with Bash "strict mode"](https://www.youtube.com/watch?v=4Jo3Ml53kvc)
|
||
- [BashFAQ105](http://mywiki.wooledge.org/BashFAQ/105)
|
||
|
||
### Using `eval`
|
||
|
||
Never.
|
||
|
||
It opens your code to code injection and makes static analysis impossible.
|
||
Almost every use-case can be solved more safely with arrays, indirect expansion,
|
||
or proper quoting.
|
||
|
||
---
|
||
|
||
Common Mistakes
|
||
---------------
|
||
|
||
### Using {} instead of quotes.
|
||
|
||
Using `${f}` is potentially different than `"$f"` because of how word-splitting
|
||
is performed. For example.
|
||
|
||
``` bash
|
||
for f in '1 space' '2 spaces' '3 spaces'; do
|
||
echo ${f}
|
||
done
|
||
```
|
||
|
||
yields:
|
||
|
||
```
|
||
1 space
|
||
2 spaces
|
||
3 spaces
|
||
```
|
||
|
||
Notice that it loses the amount of spaces. This is due to the fact that the
|
||
variable is expanded and undergoes word-splitting because it is unquoted. This
|
||
loop results in the 3 following commands being executed:
|
||
|
||
``` bash
|
||
echo 1 space
|
||
echo 2 spaces
|
||
echo 3 spaces
|
||
```
|
||
|
||
The extra spaces are effectively ignored here and only 2 arguments are passed
|
||
to the `echo` command in all 3 invocations.
|
||
|
||
If the variable was quoted instead:
|
||
|
||
``` bash
|
||
for f in '1 space' '2 spaces' '3 spaces'; do
|
||
echo "$f"
|
||
done
|
||
```
|
||
|
||
yields:
|
||
|
||
```
|
||
1 space
|
||
2 spaces
|
||
3 spaces
|
||
```
|
||
|
||
The variable `$f` is expanded but doesn't get split at all by bash, so it is
|
||
passed as a single string (with spaces) to the `echo` command in all 3
|
||
invocations.
|
||
|
||
Note that, for the most part `$f` is the same as `${f}` and `"$f"` is the same
|
||
as `"${f}"`. The curly braces should only be used to ensure the variable name
|
||
is expanded properly. For example:
|
||
|
||
``` bash
|
||
$ echo "$HOME is $USERs home directory"
|
||
/home/dave is home directory
|
||
$ echo "$HOME is ${USER}s home directory"
|
||
/home/dave is daves home directory
|
||
```
|
||
|
||
The braces in this example were the difference of `$USER` vs `$USERs` being
|
||
expanded.
|
||
|
||
### Abusing for-loops when while would work better
|
||
|
||
`for` loops are great for iteration over arguments, or arrays. Newline
|
||
separated data is best left to a `while read -r ...` loop.
|
||
|
||
``` bash
|
||
users=$(awk -F: '{print $1}' /etc/passwd)
|
||
for user in $users; do
|
||
echo "user is $user"
|
||
done
|
||
```
|
||
|
||
This example reads the entire `/etc/passwd` file to extract the usernames into
|
||
a variable separated by newlines. The `for` loop is then used to iterate over
|
||
each entry.
|
||
|
||
This approach has a lot of issues if used on other files with data that may
|
||
contain spaces or tabs.
|
||
|
||
1. This reads *all* usernames into memory, instead of processing them in a
|
||
streaming fashion.
|
||
2. If the first field of that file contained spaces or tabs, the for loop would
|
||
break on that as well as newlines.
|
||
3. This only works *because* `$users` is unquoted in the `for` loop - if
|
||
variable expansion only works for your purposes while unquoted this is a good
|
||
sign that something isn't implemented correctly.
|
||
|
||
To rewrite this:
|
||
|
||
``` bash
|
||
while IFS=: read -r user _; do
|
||
echo "$user is user"
|
||
done < /etc/passwd
|
||
```
|
||
|
||
This will read the file in a streaming fashion, not pulling it all into memory,
|
||
and will break on colons extracting the first field and discarding (storing as
|
||
the variable `_`) the rest - using nothing but bash builtin commands.
|
||
|
||
- [YSAP038](https://ysap.sh/v/38/)
|
||
|
||
---
|
||
|
||
References
|
||
----------
|
||
|
||
- [YSAP](https://ysap.sh)
|
||
- [BashGuide](https://mywiki.wooledge.org/BashGuide)
|
||
- [BashPitFalls](http://mywiki.wooledge.org/BashPitfalls)
|
||
- [Bash Practices](http://mywiki.wooledge.org/BashGuide/Practices)
|
||
|
||
Get This Guide
|
||
--------------
|
||
|
||
- `curl style.ysap.sh` - View this guide in your terminal.
|
||
- `curl style.ysap.sh/plain` - View this guide without color in your terminal.
|
||
- `curl style.ysap.sh/md` - Get the raw markdown.
|
||
- [Website](https://style.ysap.sh) - Dedicated website for this guide.
|
||
- [GitHub](https://github.com/bahamas10/bash-style-guide) - View the source.
|
||
|
||
License
|
||
-------
|
||
|
||
MIT License |