2.11. Supplemental: Bitwise Operators

Some exotic programming sub-disciplines like communications, embedded programming, and cryptography use bitwise operations extensively. Although they are used less frequently in "ordinary" programming, they are necessary for common tasks like file I/O operations, which the text covers in its last chapter. C++ implements bitwise operations with operators, making their inclusion in this chapter appropriate. However, the text doesn't rely on them significantly until it covers file I/O, so it labels them as "supplemental" until revisiting them later.

C++ inherits six bitwise operators from C. Bitwise operations are always available in assembly language but are less common in higher-level languages. Including these operators in C made it possible to write operating systems and device drivers in C rather than in assembly. Most bitwise operators require two integer arguments, but complement is a unary operator. Three operators act on the corresponding bits of the two operands; we can summarize these and the complement operators with truth tables. Two operators treat one operand as a string of bits and shift them to the left or the right. We'll often view one or both operands as a short string of bits for convenience and ease of illustration.

Basic Bitwise Operators

The basic bitwise operators are simple enough to describe them with simple truth tables. When we use these operators, it's convenient to think about or view both operands in binary: 1s and 0s. Each 0-value corresponds to false, and each 1-value corresponds to true. A simple example follows each truth table, illustrating the meaning of "the bitwise operators operate on the corresponding bits of the two operands." Operating on two bits with a bitwise operator produces a single bit. The following figures detail

the symbol used for each operator,
the operator's name,
an exhaustive list of possible inputs with corresponding outputs,
a simple rule to summarize the behavior of the operator, and
a simple, 4-bit example. Although the bitwise operators can operate on integers of any size, the examples rely on 4-bit values simplicity. Apply the operator column by column: the pair of bits in each column above the line are the operands, while the bit below the line is the result.

a	b	a & b
0	0	0
0	1	0
1	0	0
1	1	1

12 & 9 = 8

  1100
& 1001
------
  1000

Bitwise-AND. Both operands must be 1 to produce a 1. Bitwise-AND is used to switch off or mask out bits.

a	b	a \| b
0	0	0
0	1	1
1	0	1
1	1	1

12 | 9 = 13

  1100
| 1001
------
  1101

Bitwise-OR. Both operands must be 0 to produce a 0. Bitwise-OR is used to switch on or set a bit to 1.

a	b	a ^ b
0	0	0
0	1	1
1	0	1
1	1	0

12 ^ 9 = 5

  1100
^ 1001
------
  0101

Bitwise XOR (Exclusive-OR). Operands must be different to produce a 1. XOR is reversible: if A^B=C, then A^C=B, and C^B=A.

a	~a
0	1
1	0

~12 = 3

~ 1100
------
  0011

Bitwise Complement. The bitwise complement operator calculates the one's-complement by toggling the 1s to 0s and the 0s to 1s. Adding 1 to the one's complement produces the two's complement.

Viewing the bitwise operations in base-10 is often inconvenient. Binary or base-2 is more convenient, but C++ doesn't have a binary number notation - and long strings of 1s and 0s are problematic. Therefore, programmers typically denote binary numbers, especially constants, in hexadecimal (or occasionally in octal). A single hexadecimal digit corresponds to a nibble (i.e., to 4 bits). So, we can compactly specify each 4-bit cluster as a single hexadecimal digit.

Decimal (base-10)	Octal (base-8)	Hexadecimal (base-16)	Binary (base-2)
0	0	0x0	0000
1	01	0x1	0001
2	02	0x2	0010
3	03	0x3	0011
4	04	0x4	0100
5	05	0x5	0101
6	06	0x6	0110
7	07	0x7	0111
8	010	0x8	1000
9	011	0x9	1001
10	012	0xa	1010
11	013	0xb	1011
12	014	0xc	1100
13	015	0xd	1101
14	016	0xe	1110
15	017	0xf	1111

Forming bit patterns. We can form bit patterns with numerical values in any base, but hexadecimal is particularly convenient, especially for longer patterns. C++ directly supports decimal, octal, and hexadecimal notation. You can recreate this table by beginning each column at zero and counting in the appropriate base, remembering to carry. Decimal numbers are undecorated, while octal and hexadecimal numbers begin with a leading 0 and x, respectively.

Bitmasks and Bit Vectors

It's convenient to think of bits as switches: a 1 represents when the switch is on and 0 when it is off. Programmers typically use the logical-AND and logical-OR operators to set (switch on), reset (switch off), and test the bits stored in a multi-bit data structure. Programmers can implement the structures in many ways, but a simple common approach - the one illustrated here - treats the individual bits of an integer as a set of switches or bits. There is no structural difference between an integer representing a number and one representing a set of bits - the only difference is how a program uses them. The term bit vector names an integer whose contents the program treats as a set of bits rather than as a number. The names bit fields, bit sets, bit maps, and bit strings are synonyms. A bitmask is constant bit vector and is often symbolically named for convenience. Programs often use them to represent common or frequently used switch settings.

An image depicting bit-masks as a grate through which each bit must pass. Bits form the slots in the grate. For bitwise AND, 1s represent open slots in the grate that allow the bits to pass through unmodified, while 0s switch bits off, always outputting a 0 regardless of the input. — **Masks and bitwise-AND**. Programmers turn switches or settings off with the Bitwise-AND operator.

Both operands must be a 1 to produce a 1; any other combination produces a 0.

The bitwise-AND operator, `&`, masks out (switches off) the bits in a bit vector corresponding to zeros in the mask.

Imagine the bit-mask as a filter: 0s close the filter, blocking out the corresponding data bits and injecting 0s (closed arrow heads); 1s are open, passing the corresponding data bits through (open arrow heads).

An image depicting bit-masks as a grate through which each bit must pass. Bits form the slots in the grate. For bitwise OR, 0s represent open slots in the grate that allow the bits to pass through unmodified, while 1s always output a 1 regardless of the input value. — **Masks and bitwise-OR**. Programmers turn switches or settings on with the bitwise-OR operator.

Both operands must be 0 to produce a 0; any other combination produces a 1.

The bitwise-OR operator, `|`, switches on some bits in a bit vector corresponding to ones in the mask.

Imagine the bit-mask as a generator: 0s in the bit-mask are open, passing through the corresponding data bits without changing them (open arrow heads). However 1s in the mask generate or inject 1s in the result's corresponding bit position or column (closed arrow heads).

Bit-Shift Operators

The two bit-shift operators should look familiar to you, not because we have used them before, but because they are reused as the output and input operators introduced previously. Both operands are integers, and we will continue to view the left-hand operand in binary but will now view the right-hand operand in decimal. Both bit-shift operators treat the left-hand operand as a string of 1s and 0s and shift them left or right by the number of places indicated by the right-hand operand. Shifting may seem confusing but is easy to understand when illustrated with an example.

Shifting Left

The left shift operator, << moves the bits in an integer to the left. The right-hand operand specifies how many places to shift the bits. For example:

11001100 (base 2) << 2 (base 10) is 00110000 — **Left Shift Operator**. Two views of the left shift operator. The shift operation moves an integer's bits to the left. The operation discards the most significant bits, illustrated with strikeout characters, and opens spaces in the least significant positions.

The left-hand operand is shown in binary for clarity

The right-hand operand, shown in decimal, is the number of places to shift the left operand

The high-order bits shifted out on the left are discarded

The vacated or opened low-order bits on the right are filled with 0s (highlighted in yellow)

Shifting Right

The right shift operator, >>, is similar to the left shift operator but is a little more complicated. The right shift operator moves the bits in the left-hand operand to the right by the number of places specified by the right-hand operand. The operation shifts the bits out on the right side, discarding them as expected. However, how the operation fills the empty spaces on the left complicates the right shift operator.

Without programmer intervention, the underlying hardware determines how to fill the spaces vacated by the shift. (The ANSI standard calls such features implementation dependent.) Some hardware implements sign extension (i.e., it fills the empty spaces with a copy of the left-most bit), and some hardware does not (i.e., it fills the empty spaces with 0s).

Fortunately, programmers can intervene. In a signed integer (a number capable of storing negative and positive values), the highest-order bit is called the sign bit. Computers generally treat a number as negative when the sign bit is 1 and non-negative (i.e., zero or positive) when it is 0. Negative values are generally not needed when dealing with bit patterns, and so the easy "fix" is defining the integer as unsigned. (Using unsigned integers, variables, and constants with all the bitwise operators is common.) When the right shift operator's left operand is unsigned, it always fills the empty spaces on the left with 0s regardless of how the hardware behaves by default.

The following examples demonstrate the right shift operator with and without sign extension:

11001100 (base 2) >> 2 (base 10) is 00110011 — **Right Shift Operator**. Two views of the right shift operator *without sign extension*. The shift operation moves an integer's bits to the right. It discards the least significant bits, illustrated with strikeout characters, and opens spaces in the most significant positions. Three cases produce the illustrated results: (a) the left operand is unsigned, (b) the hardware does not perform sign extension, or (c) the original highest-order bit is a 0.

The left-hand operand and the result are shown in binary for clarity

The right-hand operand, shown in decimal, is the number of places to shift the left operand

The low-order bits on the right are discarded

The vacated or opened high-order bits on the left are filled with 0s (highlighted in yellow)

11001100 (base 2) >> 2 (base 10) is 11110011 — **Right Shift Operator with sign extension**. Two views of the right shift operator *with sign extension*. The shift operation moves an integer's bits to the right. The shift discards the least significant bits, illustrated with strikeout characters, and opens spaces in the most significant positions. However, how the computer fills the open spaces depends on a combination of circumstances. First, sign extension is a property of the underlying hardware and beyond program control. sign-extend hardware *copies* the highest-order bit into the opened positions. So, the second circumstance is the value of the highest-order bit, highlighted with orange, when the shift operation occurs. So, the program produces the illustrated result only when the left operand is signed, the hardware performs sign extension, and the highest-order bit is 1.

The left-hand operand and the result are shown in binary for clarity

The right-hand operand, shown in decimal, is the number of places to shift the left operand

The low-order bits on the right are discarded

The vacated or opened high-order bits on the left are filled with a copy of the original left-most bit (in orange)

Bitwise Operators With Assignment

Earlier in the chapter, we saw that C++ allows a shorthand notation with arithmetic operators called "Operation With Assignment." We can also use this notation with the binary bitwise operators.

Operation With Assignment	Meaning
`V &= E`	`V = V & E`
`V \|= E`	`V = V \| E`
`V ^= E`	`V = V ^ E`
`V >>= I`	`V = V >> I`
`V <<= I`	`V = V << I`

Operation with assignment with bitwise operators. B is a Boolean variable, E is a Boolean-valued expression, and I is an integer-valued expression (often a constant or variable).