Roadmap to becoming a developer in 2022
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 

5.2 KiB

Text Manipulation

Some of the most common commands you will use in the terminal are text manipulation commands. These commands allow you to search, replace, and manipulate text in files and streams. Here are some of the most common commands you will use:

  • awk - A programming language designed for text processing and typically used as a data extraction and reporting tool.
  • sed - A stream editor for filtering and transforming text.
  • grep - A command-line utility for searching plain-text data sets for lines that match a regular expression.
  • sort - A command-line utility for sorting lines of text files.
  • cut - A command-line utility for cutting sections from each line of files.
  • uniq - A command-line utility for reporting or omitting repeated lines.
  • cat - A command-line utility for concatenating files and printing on the standard output.
  • echo - A command-line utility for displaying a line of text.
  • fmt - A command-line utility for simple optimal text formatting.
  • tr - A command-line utility for translating or deleting characters.
  • nl - A command-line utility for numbering lines of files.
  • wc - A command-line utility for printing newline, word, and byte counts for files.

awk is a general-purpose scripting language used for manipulating data or text and generating reports in the Linux world. It is mostly used for pattern scanning and processing. It searches one or more files to see if they contain lines that match the specified patterns and then performs the associated actions.

sed(Stream Editor) command in UNIX can perform lots of functions on file like searching, finding and replacing, insertion or deletion. By using SED you can edit files even without opening them in editors like VI Editor.

The grep command (global search for regular expression and print out) searches file(s) for a particular pattern of characters, and displays all lines that contain that pattern. It can be used with other commands like ps making it more useful.

sort command is used to sort the contents of a file in a particular order. By default, it sorts a file assuming the contents are in ASCII. But it also can also be used to sort numerically by using appropriate options.

The cut utility cuts out selected portions of each line (as specified by list) from each file and writes them to the standard output.

The uniq utility reads the specified input_file comparing adjacent lines, and writes a copy of each unique input line to the output_file.

cat (concatenate) command is very frequently used in Linux. It reads data from the file and gives its content as output. It helps us to create, view, and concatenate files.

echo is a built-in command in Linux used to display lines of text/string that are passed as an argument. It is mostly used in shell scripts and batch files to output status text or ENV variables to the screen or a file.

fmt command is for formatting and optimizing contents in text files. It will be really useful when it comes to beautify large text files by setting uniform column width and spaces.

The tr utility copies the standard input to the standard output with substitution or deletion of selected characters.

The nl utility reads lines from the named file or the standard input if the file argument is omitted, applies a configurable line numbering filter operation and writes the result to the standard output.

The wc utility displays the number of lines, words, and bytes contained in each input file, or standard input (if no file is specified) to the standard output.