4 Navigating the Unix file system
You can skip this section and proceed directly to the exercises if you are already familiar with the Unix directory structure and basic commands like cd
and ls
.
We have provided a list of helpful tips and hints in the appendix: Section A.1. Have a look already and refer back to it after you have worked your way through the next sections on navigation and basic commands.
Additionally, there is an overview of some of the most common symbols that are used by the Unix shell here: Section A.2.
4.1 Layout of the Unix file system
All files and directories (or folders) in Unix are stored in a hierarchical tree-like structure, similar to what you might be used to on Windows or Mac (cf. File Explorer). The base or foundation of the directory layout in Unix is the root (/
) (like the root of a tree). All other files and directories are built on top of this root location. When navigating the file system, it is also important to be aware of your current location. This is called the working directory.
The address of a particular file or directory is provided by its filepath: this is a sequence of location names separated by a forward slash (/
), like /home/user1
. Note that this differs from the convention in Windows, where backslashes (\
) are used in file paths instead.
There are two types of file paths: absolute and relative paths.
- Absolute file path: this is the exact location of a file and is always built up from the root location. E.g.,
/home/user1/projects/document.txt
. - Relative file path: this is the relative address of a file compared to some other path. E.g., from the perspective of
/home/user1
, the filedocument.txt
is located inprojects/document.txt
.
4.1.1 Home sweet home: ~
Another important location is the home directory. In general, every user has their own home directory, found in /home/username
. A frequently used shortcut for this is the tilde symbol (~
). Depending on the current user, this will refer to a particular directory under /home/.
.
user1
write the file path to document.txt
using the ~
shortcut?
~/projects/document.txt
4.1.2 Where am I? .
shortcuts
The dot (.
) also has an important function in file paths:
.
represents the directory you are currently in, i.e. the working directory.- E.g., while inside the
projects
directory, any files inside can be accessed using eitherfilename
or./filename
.
- E.g., while inside the
..
represents the parent directory of the working directory.- E.g., from
/home/user1/Desktop
, the relative path to filedocument.txt
can be written as../projects/document.txt
. - These expressions can be nested; while inside the
projects
directory,../../user2
can be used to access theuser2
home directory.
- E.g., from
4.2 Moving around the file system
In this section we will introduce a few essential commands that allow you to navigate the file system: pwd
, cd
and ls
.
4.2.1 pwd
: avoid getting lost
pwd
stands for print working directory and it does exactly that: it allows you to figure out where you are in the file system. For example, in the figure above, user1
would generally find themselves in their home directory upon login:
$ pwd
/home/user1
4.2.2 cd
: on the move
Next, there is the cd
command. This is used to move between directories (the name derives from change directory). Simply follow the command name by a file path to navigate there: cd <filepath>
. To move from user1
’s home directory to the projects
directory:
cd projects
Note that you can use the special symbols we saw earlier as navigational shortcuts:
Command | Result |
---|---|
cd ~ |
Change to home directory (/home/username ) |
cd .. |
Change to parent directory (e.g., go up 1 directory) |
cd / |
Change to the root location |
4.2.3 ls
: show me what you got
Finally, we have the ls
command. Its name stands for listing and it will list the names of the files and directories in the current working directory. The basic structure of the command ls [OPTIONS] <target>
, with <target>
being an optional path to a directory.
To continue upon our previous example, from inside /home/user1/projects
we would see:
$ ls
DRX333466_1.fastq.gz DRX333466_2.fastq.gz document.txt
Note that we did not specify a path, in which case ls
will just list the contents of the current working directory. If we do specify a path, we will of course be shown the contents of that particular location:
$ ls /home
user1 user2
By default, the files and directories are listed in alphabetically order and depending on your terminal settings, files and directories might even be colour-coded differently.
ls
also comes with a few handy optional flags to modify its behaviour:
Command | Result |
---|---|
ls -l |
Show detailed list view |
ls -hl |
Show detailed list view and print file sizes in a human readable format |
ls -a |
List all files and directories, including hidden ones |
ls -lha |
Combine all options into one command |
ls --help |
Show more information on the ls command and its options |
ls |
Earlier, we mentioned that .
is used to refer to the current working directory, but it actually has a second function as well. Any file or directory name that starts with a dot (like /home/user1/.ssh
) will be hidden and not displayed by default when using ls
, hence the need for the -a
flag.
Linux often hides system or configuration files to avoid cluttering up your (home) directory. We will not deal with hidden files directly in this course, but one of the situations where you might encounter them are when modifying your .bashrc
file (e.g., when creating custom functions, aliases or tweaking your PATH
Section A.4) or when managing SSH keys for remote server access Section A.6).
The ls -l
command is particularly useful, because it shows all types of additional information.
ls -l
represent?
$ ls -l
total 83764
-rw-r--r-- 1 pmoris pmoris 14367565 Dec 7 09:39 3B207-2_S92_L001_R1_001.fastq.gz
-rw-r--r-- 1 pmoris pmoris 16622378 Dec 7 09:39 3B207-2_S92_L001_R2_001.fastq.gz
-rw-r--r-- 1 pmoris pmoris 13592342 Dec 7 09:39 MRA1242_S28_L001_R1_001.fastq.gz
-rw-r--r-- 1 pmoris pmoris 15821981 Dec 7 09:39 MRA1242_S28_L001_R2_001.fastq.gz
-rw-r--r-- 1 pmoris pmoris 12131772 Dec 7 09:39 NK6_S57_L001_R1_001.fastq.gz
-rw-r--r-- 1 pmoris pmoris 13226198 Dec 7 09:39 NK6_S57_L001_R2_001.fastq.gz
The first column represents the permissions of the files/folders. In a nutshell, these determine things like who can read or write (= modify, including deletion) particular files. There is a column for the owner, a group of users and everyone else. There is more info in the appendix (Section A.5). The next column showing a 1
for each entry, you can ignore for now (they represent hard links, a concept we will not dive into). The two names in the following columns are the user and the group owner of the file. Next is the size of the file in bytes. If we had used the -h
flag, the size would have been shown in KB, MB or GB instead. Next we have the time of the last modification and finally the name of the file/directory.
4.3 Exercises
Note: In the exercises below, we use relative paths. These paths are all relative to the FA5-bioinformatics
directory. For example, during the FA5 course hosted at AHRI, the relative path ./training/data/reference
corresponds to the absolute path ~/Desktop/Fa/FA5-bioinformatics/training/data/reference
.
- Navigate to your home directory and list all the files and folders there. Try typing the path with and without using the
~
. Rely on tab-completion to assist you and avoid typos (Section A.1). - Print the name of the current working directory to your screen.
- List the contents of the
./training/data/fastq/
directory of the course files, without first moving there. Experiment with absolute and relative paths. - What is the most recent modification date of the file
Homo_sapiens.GRCh38.dna.chromosome.Y.truncated.fa
found in the./training/unix-demo/
directory? - Try to search for the file
penguins.csv
: what is the absolute path to it on your machine? - Navigate to
./training/data/fastq/
to make it your working directory (double check usingpwd
!). What is the relative path to thepenguins.csv
file from here? - Suppose your working directory is still
./training/data/fastq/
. What will the result ofpwd
be after running each of the following commands in succession?
cd ../
cd ../unix-demo/
cd files_to_loop_through/../../data/..
cd /
cd ~
4.4 Summary
- Absolute versus relative file paths
- Root (
/
) and home directory (~
) .
represents the current working directory..
represents the parent directorypwd
: print the path of the current working directorycd <path>
: navigate to the given directoryls <path>
: list files and directories in the given location- Hidden files contain a
.
at the start of their name and are not visible by default