10.5.4. Copy Constructor

Operations That Copy Objects

To understand which functions we need to override, we must first understand which C++ statements copy an object and which functions perform the operation. Spotting an object copy is generally easy, but identifying the copying function is not, as the following figure demonstrates.

Functions Copying Objects	Compiler-Created Copy Functions
void person::f1(person p3) // pass by value { ... } person person::f2() // return by value { ... return person(...); } person person::f3() // return by value { person temp(...); ... return temp; }	person::person(person& p) // copy constructor { ... } person& person::operator=(person& p) // assignment operator { ... return *this; }
Statements Triggering Object Copies	Comments
person p1(p);	Copy Constructor: Creates a new person object, `p1`, by copying an existing object, `p`.
f1(p);	Copy Constructor: The copy constructor creates the parameter object, `p3` (see `f1` above left), by copying the argument object, `p`.
person p2 = p;	Copy Constructor: The assignment operator, `=`, notwithstanding, the statement creates a new object, `p2`, and the copy constructor builds it by copying an existing object, `p`.
p0 = p;	Assignment Operator: The assignment operator copies an existing object to another existing object. Any data stored in `p0` is overwritten.
p0 = f2();	Assignment Operator: `f2` returns an object by value, and the assignment operator copies it to an existing variable, `p0`.
person p4 = f2();	No copy!: When a program defines a new object, e.g. `p4`, and creates a new object in the `return` statement (see `f2`), C++ builds the object in function-call scope - directly in `p4` for this example.
person p5 = f3();	Copy Constructor: Unlike `f2`, `f3` creates and returns a temporary variable, which the copy constructor copies to `p5`. `f3` triggers two more function calls than `f2`: one constructor and one destructor.

Operations copying objects. The C++ compiler automatically creates two functions, a copy constructor and an assignment operator, to copy objects in a program. The figure illustrates three functions that utilize one of the copy functions and the statements calling them, but sometimes it's unclear which copy function a statement calls. The ellipses represent detail removed for simplicity. See person.cpp at the bottom of the page.

The Compiler-Created Copy Constructor

The compiler-created copy constructor is necessarily simple and general. Our first task is to understand what the copy constructor does and then explore how the compiler might implement it.

string name; int weight; double height;
(a)	(b)
person::person(person& p) { name = p.name; weight = p.weight; height = p.height; }	person::person(person& p) { memcpy(this, &p, sizeof(person)); }
(c)	(d)

Copying a "simple" object with the compiler-created copy constructor. A program copies or duplicates an object by allocating memory and calling a copy constructor to initialize the new object's member variables. The compiler-created copy constructor copies the values saved in the exiting object's member variables to the corresponding members of the new object.

The person class's member variables: UML class diagram and C++ code. Class developers don't include the copy constructor in the UML class diagram nor any C++ code - the compiler creates it automatically and transparently (i.e., invisibly).
A symbolic representation of an object copy - the function copies the values saved in one object to a new object.
A copy constructor effectively copies each member variable with an assignment operation. While this technique is easy to understand, it's inefficient and difficult for compiler-writers to implement.
Copying a block of memory is easier and more efficient. This approach does not require the compiler to "know" about individual members - only the new object's address and size. The memcpy function prototype is:
```
void* memcpy(void* dest, const void* src, size_t n);
```
- dest is the address receiving the data
- src is the address of the original data
- n is the number of bytes to copy
- void* denotes the address of typeless data (i.e., data of an unspecified type) and is the most generic kind of data in a C++ program (similar in many ways to an Object reference in a Java program). size_t is a type alias.

Caution

A fundamental object-oriented principle is that a class hides its implementation - the data it stores and how its functions manage it - from programmers. Consequently, we can't know how a specific C++ compiler implements its string class, but it likely points to an array allocated on the heap. The string class overloads the assignment operator to handle this implementation. But memcpy is a low-level operation that is "unaware" of pointers and is unable to duplicate heap data. See Figure 4(b) below.

Overriding The Copy Constructor

The previous discussion of object ownership in aggregation relationships alluded to situations where two or more objects in a program share another object. While programmers can establish the sharing while copying a "complex" object, we generally intend the copy operation to produce two distinct and independent objects. Independence implies that once the copy operations are complete, we can change either object without affecting the other. The compiler-created copy constructor produces independent objects when the original object is "simple," but producing independent "complex" objects requires programmers to override the compiler-created copy constructor.

string* name; int weight; double height;
(a)	(b)

Copying a "complex" object with the compiler-created copy constructor. This example illustrates what happens when a program copies a "complex" object using the compiler-created copy constructor.

This version changes the name member variable to a pointer, changing the person class from "simple" to "complex."
The compiler-created copy constructor accurately copies the original object's member variables, including its pointers. But the value saved in a pointer is an address. Consequently, the original and the copied objects save the same address in their respective pointer members - they point to the same part object - and are not independent. Changing the name in either object also changes the name in the other.

The compiler-created assignment operator behaves similarly, as seen in the next chapter.

Steps for overriding the copy constructor

The function's name, like all constructors, is the name of the class.
The function has exactly one parameter, which is an instance of the class, passed by reference.
Copy each non-pointer member variable by simple assignment or by using memcpy.
Copy each pointer member by allocating new memory with the new operator and copying the saved data from the original to the new object.

person::person(person& p) { name = new string(*p.name); weight = p.weight; height = p.height; }
(a)
person::person(person& p) { memcpy(this, &p, sizeof(person)); name = new string(*p.name); }
(b)	(c)

Overriding the copy constructor.

This version copies the original object member-by-member, a common approach for overriding a copy constructor. It's common to dereference any pointers before duplicating the pointer members, but this depends on the other constructors in the class.
When there are many non-pointer members, it is more efficient to do a byte-wise copy with memcpy first and then copy each pointer member individually. Note that the order of operations is important: memcpy must be done first, followed by the individual pointer copies. (Can you figure out why?)
When the copy operation is complete, the pointer members are also correctly copied, resulting in distinct and independent objects that do not share data.

If the original object has an embedded or composed part, the program must initialize it. Initialization is automatic if the object's class has a default constructor. Otherwise, the overridden copy function must explicitly call a general constructor. The Actor 3 example in the next section demonstrates this process.

	View	Download
No pointer member variables	person.cpp	person.cpp
One pointer member variable	person_pointer.cpp	person_pointer.cpp

View

Download

No pointer member variables

person.cpp

One pointer member variable

person_pointer.cpp

10.5.4. The Copy Constructor

Operations That Copy Objects

The Compiler-Created Copy Constructor

Overriding The Copy Constructor

Downloadable Code Examples