4. Variables, Values, Types and Templates#
In this chapter we will finally begin to write some code. We will start to develop the semantics and grammar for the langauge that will ultimately allow you to author your own projects. To begin, let us start off with an anology.
If C++ code is a recipe, then the variables are the ingredients (for the sake of argument, let’s assume the recipe is for a cake). Variables consist of a type and a value. Continuing our cake analogy, the type specifies which ingredient (eggs, flour, sugar), while the value specifies how much. One cannot talk about types without specifying values, but I can’t talk about values without know what the type is. This remark is central to how C++ thinks of variables:
Variables are containers that only accept values corresponding to its type.
This is true for functions (link functions), classes (link classes), and enumerations (link enumerations) (all concepts we will introduce in subsequent chapters).
4.1. Types#
C++ is a strongly typed-language. In the context of variables, this means that every variable has to have a type. Moreover, anywhere the variable is used in the code base, it needs to have a corresponding type. There essentially to types of type declarations in C++: intrinsic types and user defined tyeps.
4.1.1. Intrinsic types#
Intrinsic types are also sometimes called primitive data types. They are simple to represent on computers, and arguably, they are the reason why computers exist. These are essentially the types the C++ has inherited directly from C. Here we will summarize the intrinsic types and their common use case
declaration |
description |
---|---|
|
This is an incomplete type that is completely empty. You cannot declare void variables. You can declare functions with void return values (link return values) and void pointers (link pointers). |
|
Represents a single ASCII character, and occupies 1 byte (8 bits) of memory.
ASCII characters are really just integers under the hood; the |
|
Represents a UTF encoded characters, and cccupies 4 bytes (32 bits) of memory. |
|
(C++20) an 8-bit char |
|
a 16-bit char |
|
a 32-bit char |
|
The basic integer type. Stores a 32-bit integer |
|
A 16-bit integer |
|
A 64-bit integer |
|
An integer determined by the hardware you have, for 64-bit systems, this is generally a 64-bit integer |
|
a 32-bit decimal number |
|
a 64-bit decimal number. Here large storage means that one can keep track of more decimal places, but at the expense of operations taking longer |
|
an 8-bit integer that can only be |
Integer types can be modified with the keywords signed
and unsigned
to indicate whether the variable can store negative and non-negative numbers, respectively
These types are discussed in more detail here on cppreference.com, especially how they can be combined to produced new intrinsic data types.
The combinations of fundamental types, e.g., unsigned long long
, specify the size (bytes in memory) the integer should occcupy.
This also determines the largest integer the corresponding variables can contain.
The size of any type can be interrogated through using the sizeof()
function.
An example program that prints the size selected primitive types can be found here
For convenience, we include a screenshot
4.1.2. User-defined types#
User-defined types are precisely as they sound.
They are declared by creating classes or structs (link these), but we will postpone the precise details of how this works for later.
The one user-defined type we will mention here is the std::vector
container.
This container stores a user-specified number of variables of a user-specified type.
For example, it could 1000 floats
all within the same variable.
4.1.3. Variable declaration#
We finally come to the first semantic rule for writing C++ code: variable declaration.
Rule: Variable Declaration
At its core, variable declaration has the following form.
type_declaration variable_name
type_declaration
is replaced by a intrinisic type (int
, float
, etc) or user-defined type (std::vector
).
variable_name
is replaced by a string of characters that satisfy the followin
Rule: Variable Names
Must start with a letter or underscore,
a-z
,A-Z
,_
Can only contain a number, letter, underscore or
$
The variable name cannot have any whitespace
Examples
int count; // Valid
double _fraction; // Valid
bool FLAG_1; // Valid
long 2_file; // Compiler error
float a b; // Compiler error
4.2. Values#
As mentioned before, the value of a variable defines how much of an ingredient (type) it contains. Assigning a value to a variable is called initialization. Assigning values to variables that are already initialized is called assignment. Initialization typically happens at the same time of declaration, e.g.,
double PI; // Declaration only
int cpu_core_count{ 12 }; // Declaration and initialization
cpu_core_count = 4; // Assignment
C++ reasons about different values through discussing value categories. The two primary value categories are rvalues (read as R-values) and lvalues (read as L-values). This distintion exists because in C++ we have to think about memory management, resource acquistion and other semantic constructs within the language. It turns out that if the compiler expects an rvalue, but it given a lvalue, the compiler will throw a compilation error. We will now discuss the nuance behind their distinction
4.2.1. Lvalues#
For historical reasons, the L refers to the variable appearing on the left-hand-side of the assignment operator =
.
According the ISO standard,
an lvalue is an expression whose evaluation identifies an object or function. Object here means variable name, while function refers to a function’s name. In general, we will think of lvalues as initialized variables.
4.2.2. Rvalues#
For historical reasons, the R refers to the expression appearing on the right-hand-side of the assignment operator =
.
One thing to consider alreay is that in the following example
int foo{ 10 }; // Could be written as foo = 10;
int baz{ foo }; // Could be written as baz = foo;
both foo
and baz
are lvalues, but 10
is an rvalue.
Another example of an rvalues is the return value of a function (there are caveats here that we will ignore for now).
4.2.3. Value categorities derived from l- and rvalues#
In recent years, the C++ standard has expanded the value categories to include prvalue, xvalue, and glvalues to help delineate between the roles that values can play. Those more interested in the precise taxonomy the C++ language is using now should read the cppreference article detailing it
4.3. Delcaration qualifiers#
Variable declarations can be modified for extra functionality, we will refer to this as type specifiers.
Three common specifiers that we will encounter are static
, const
and constexpr
.
Less common specifiers include extern
and volatile
.
We will not go throught the last two, as, IMHO, these are only really useful when coding for low level systems such as embedded systems.
Are usecases are simulation based.
4.3.1. The static
specifier#
static int foo{ 100 };
The static
specifier tells the compiler to not make the variable (or function) visible outside of the .CPP file in which it appears.
This could be useful if there are variables you want to hide from other parts of the program.
This is a remanent of the C programming language; modern C++ address this problem with encapsulation.
However, we mention the specifier here because it does appear frequently in C++ code, but for a different reason (see the warning to the right).
An example of its usage is
4.3.2. The const
specifier#
const int a{ 1 };
int const a{ 1 }; // is equivalent to the above
a = 2; // Compile-time error
This specifier indicates that the value of the variable cannot be change after initialization.
4.3.3. The constexpr
specifier#
constexpr int a{ 1 };
a = 2; // Compile-time error
This is a stronger specifier than const, it tells the compiler that the variable’s value is known at compile time, and will not change for the duration of the program. This is particularly useful when you want to offload certain computations to the compiler, and can provide a dramatic improvement in speed of the program (up 10,000x).
4.4. Templates#
The last topic, and by far the most complex, is the template semantic. Templates are what truly distinguish C++ from C, and allow you to write code that is generic in the type you want to use. There utility is far more visible for functions and class declarations, but we are going to give an incremental introduction.
We come to our last rule of the chapter, how to declare (and use) variables with template parameters.
Rule: Templated Variable Declaration
template<typename T>
int size;
// Equivalently
template<typename T> int size;
Here template
and typename
are reserved words, while we are free to replace T
with anything we want (as long as it obeys the name of variable declarations, link these).
When dealing with templates, the declaration exist completely separately from the initialization. This is because, we also need to the instantiate the template parameter. For templated variable declarations, the only way to instantiate the template is through template specialization.
Rule: Template Variable Specialization
template<typename T>
int size;
// Specializations
template<> int size<bool> = 1;
template<> int size<int> = 4;
template<> int size<double> = 8;
Even though all variables have the same name, they are distinguished by their template instantiation, which is good enough for the compiler to distinguish them. There are not many use-cases that I can think of for this particular semantic, besides some analogy of the example provided. An example program with this code can be found here. Note, that template variables are identified by the presence of the a variable name followed by angled brackets with a type declaration inside.
4.5. Summary#
To summarize, we introduced variables, their declaration (which require types), there initializations (which requires values) and templates which treat types as values. For convenience the rules in green boxes are reproduced below for future reference
Rules for Variable Declarations
Variable declarations follow the syntax
type_specifier type_declaration variable_name
Allowed expressions are
type_specifier
: (optional)static
,const
, orconstexpr
.type_declaration
: intrinsic types, such asint
,float
orbool
, and user-defined types such asstd::vector
variable_name
: following the three naming rules above
Templated variable declarations follow the syntax
template<typename template_name>
type_specifier type_declaration variable_name
where template_name
has to satisfy the same naming rules for variable declarations.
Templated variable declarations have to specialized for concrete types following the syntax
template<> type_specifier type_decalaration variable_name<concrete_type> = value