Digital computers don't understand the languages that humans speak. Instead, they use a binary language called machine code or machine language. Machine code consists of a sequence of simple computer instructions, with each instruction expressed as a string of binary digits or bits (i.e., 1's and 0's). Different computers typically "speak" or "understand" different machine languages. For example one computer may represent the ADD operation as
10011111 while another might represent the same operation as
000110. The size of machine code instructions may also vary from one computer to another: 32-bit instructions are still in use, but 64-bit instructions are now the most common.
Furthermore, when a program runs, the operating system (e.g., Windows, Linux, macOS, etc.) acts as a host environment that provides services to the program. These services include essential support such as access to the keyboard, screen, and hard drives. Unfortunately, how the program accesses those services is also different from one operating system to the next. As a result of the differences between machine languages and operating system requirements, programs written in machine language are more focused on the system running the program than on how the program solves a problem. Furthermore, it also means that it isn't possible to move machine code between different computers without providing a translation service - usually in the form of a virtual machine.
Writing programs in machine language is slow, tedious, and error-prone. Today, we write most programs in higher-level programming languages that focus more on the problem and less on the system (the hardware and the operating system) running the program (see Figure 2). But computers can't (with rare exceptions) directly execute programs written in high-level languages, so there must be some way of translating a program written in a high-level language into machine language. Two kinds of computer programs perform the necessary translation: compilers and interpreters.
A compiler is a program that translates other programs written in a high-level programming language like C or C++ into machine code or machine language. Some languages such as Java and C# take a different route. Compilers for these languages translate the high-level source code into an intermediate form (a representation that lies somewhere between the high-level and true machine code) called virtual machine code. The virtual machine code then becomes the input to another program called an interpreter or virtual machine (VM), a program that simulates a hardware CPU.
Python programs have some fully interpreted parts, but these parts are a small portion of the overall program and have minimal impact on the overall runtime. The Python libraries, where the program spends most of its time, are written in C and run very fast. So, Python programs run almost as fast as an equivalent C program.
On the other hand, once we compile a program written in purely compiled languages, we can't easily move the resulting executable machine code to a different platform (e.g., you can't run a Windows program to an Apple computer). In contrast, we can easily move programs we write in interpreted languages between different computers.
Interpreted programs are portable because they run on a VM or interpreter. From the hardware and operating system's perspective, the interpreter is the running program. We write interpreters and VMs in purely compiled languages, so they are not portable, but the programs that they run are. Once we install the interpreter on a system, we can move interpretable programs to the system and run them without further processing.