Rust Documentation: Chapter 4 - Ownership

In the previous chapter, we saw how we can use data types, functions, and control flow in rust. In this chapter, we will look at the most unique feature of rust - ownership.

Ownership

Ownership enables Rust to make memory safety guarantees without needing a garbage collector, so it’s important to understand how ownership works. In this chapter, we’ll talk about ownership as well as several related features: borrowing, slices, and how Rust lays data out in memory.

Stack and Heap

In stack, it is easier to store objects that have a known size at compile time. The stack stores values in the LIFO order. In heap, it is easier to store objects that have an unknown size at compile time or a size that might change. The heap stores values in any order, wherein the compiler looks for a place to store the data.

Pushing to the stack is faster than allocating on the heap because the allocator never has to search for a place to store new data; that location is always at the top of the stack. Comparatively, allocating space on the heap requires more work because the allocator must first find a big enough space to hold the data and then perform bookkeeping to prepare for the next allocation.

When we call a function, all the parameters of the function are pushed onto the stack. When the function is over, the parameters are popped off the stack. This is why the size of all the parameters of a function must be known at compile time. Usually, all vairables that have a known data type, i.e., i32, u32, f32, etc. are stored on the stack.

In heap, we usually keep a pointer to the location in memory where the data is stored.

Ownership Rules

In rust, there are a few rules that the compiler checks to ensure memory safety. These are:

Each value in Rust has an owner.
There can only be one owner at a time.
When the owner goes out of scope, the value will be dropped.

Memory and Allocation

In rust, when we take a String type, we might not always know how much memory we need to allocate for it, since it is possible to later change the value that the String type holds. We can see this in an example below:

    let mut s = String::from("hello");

    s.push_str(", world!"); // push_str() appends a literal to a String

    println!("{}", s); // This will print `hello, world!`

We can see that we can add string literals to rust strings. However, this type of operation is not possible for string literals. This main difference comes because of the way rust handles memory and allocation for the String type.

With the String type, in order to support a mutable, growable piece of text, we need to allocate an amount of memory on the heap, unknown at compile time, to hold the contents. This means:

The memory must be requested from the memory allocator at runtime.
We need a way of returning this memory to the allocator when we’re done with our String.

The String::from function does all the memory allocation for us. However, once we have requested memory, we need to keep track of when that memory needs to be freed up.

In rust, the memory is automatically returned once the variable that owns it goes out of scope. This can be seen in the example below:

    {
        let s = String::from("hello"); // s is valid from this point forward

        // do stuff with s
    }                                  // this scope is now over, and s is no longer valid

When a variable goes out of scope, the drop function is called that frees up the memory.

Variable and Data Interaction with `Move`

In rust, we can assign a String to a different variable using let as follows:

    let s1 = String::from("hello");
    let s2 = s1;

When we do this, we expect s2 to also contain the contents of the String that s1 contains. However, this is not the case.

A String is made up of three parts: a pointer to the memory that holds the contents of the string, a length, and a capacity. This group of data is stored on the stack. The memory that the pointer refers to is stored on the heap.

This can be seen in the below image:

Memory Allocation

When we assign s1 to s2, the String data is copied, meaning we copy the pointer, the length, and the capacity that are on the stack. We do not copy the data on the heap that the pointer refers to.

In short, this is what happens

Making a copy of the data

We can see from the above image that there is not a new copy of the data that is created when we assign s1 to s2. In short, rust does not create a deep copy when we assign one variable to another.

`Double free` memory error

Now that we have s1 and s2 pointing to the same location in memory, this can potentially cause problems. For example, when both variables go out of scope, they will both try to free the same memory. This is known as a double free error.

To avoid such errors, rust considers that once we assign s1 to s2, s1 is no longer valid. This is known as a move in rust. This can be seen as a type of shallow copy, wherein s2 does not copy the data from s1, but instead, it takes ownership of the data that s1 was pointing to. s1 was moved into s2.

Important Note: rust does not create deep copy of a variable by default. To create a deep copy, we need to use the clone method.

`Clone` method

To make sure that a deep copy is created when we take s1 and assign it to s2, we can use the clone method. This copies the data from the stack (which is the pointer, length, and capacity) as well as the data from the heap into the new variable s2. We can clone a variable as follows:

    let s1 = String::from("hello");
    let s2 = s1.clone();

`Copy` trait

By default, for variables for which the size is fixed at compile time, like integers, booleans, are stored on the stack. When these variables are assigned to new variables, instead of the move operation, a copy operation is performed. The copy operation copies the value of the original variable to the new variable, while still keeping both variables valid. While this is a contradiction to the move method that rust uses, it is a design choice since it is easier to copy data from the stack than from the heap.

By default, the following types implement the Copy trait:

All the integer types, such as u32.
The boolean type, bool.
The floating point types, such as f64.
The character type, char.
Tuples, if they only contain types that also implement Copy. For example, (i32, i32) implements Copy, but (i32, String) does not.

Ownership and Functions

When we pass a variable to a function, the ownership of the variable is either moved or copied to the function, based on the type of variable that the function uses. This can be seen in the below example:

fn main() {
    let string_variable = String::from("hello"); // 'string_variable' comes into scope
    
    show_string(string_variable); // 'string_variable' is moved into the function
                                  // 'string_variable' is no longer valid
    
    let x = 5; // 'x' comes into scope
    
    show_number(x); // 'x' is copied into the function instead of moved, and is still valid after the function call
                    // 'x' is still valid after the function call
    
    
    fn show_string(string_variable: String) { // 'string_variable' comes into scope
        println!("{}", string_variable);
    } // 'string_variable' goes out of scope and the memory is freed via 'drop'
    
    fn show_number(x: i32) { // 'x' comes into scope
        println!("{}", x);
    } // 'x' goes out of scope, but nothing special happens, since there is no 'drop' to be done
}

Ownership in `return` values and scopes

Ownership works the same way for function return values as it does for variables. When a function returns a value, the ownership of the value is either moved or copied to the variable that is receiving the return value. This can be seen from this example:

fn main() {
    let s1 = gives_ownership();         // gives_ownership moves its return
                                        // value into s1

    let s2 = String::from("hello");     // s2 comes into scope

    let s3 = takes_and_gives_back(s2);  // s2 is moved into
                                        // takes_and_gives_back, which also
                                        // moves its return value into s3
} // Here, s3 goes out of scope and is dropped. s2 was moved, so nothing
  // happens. s1 goes out of scope and is dropped.

fn gives_ownership() -> String {             // gives_ownership will move its
                                             // return value into the function
                                             // that calls it

    let some_string = String::from("yours"); // some_string comes into scope

    some_string                              // some_string is returned and
                                             // moves out to the calling
                                             // function
}

// This function takes a String and returns one
fn takes_and_gives_back(a_string: String) -> String { // a_string comes into
                                                      // scope

    a_string  // a_string is returned and moves out to the calling function
}

Conclusions

The ownership of a variable follows the same pattern every time: assigning a value to another variable moves it. When a variable that includes data on the heap goes out of scope, the value will be cleaned up by drop unless ownership of the data has been moved to another variable.

Drawbacks: Functions that move ownership of variables can be cumbersome. For example, if we want to use a variable after it has been passed to a function, we cannot do so, since the variable is no longer valid. To overcome this, we can use references, which is explained in the next chapter.

Ownership#

Stack and Heap#

Ownership Rules#

Memory and Allocation#

Variable and Data Interaction with Move#

Double free memory error#

Clone method#

Copy trait#

Ownership and Functions#

Ownership in return values and scopes#

Conclusions#