Operating Systems Notes

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	156
7	djm03178	152
7	adamant	152
9	luogu_official	150
10	awoo	147

If you are struggling with the right track and a 10-min read for Operating systems from beginning to end, you're at the right place. Even I am not sure where to start, but we would figure it and if you are reading this, I have successfully completed these notes or at least am on right track.

Let's start our struggle for OS:

The Track

I found many resources to learn, let's just list all: (Galvin Concise PPTs)[https://www.os-book.com/OS9/slide-dir/index.html]: These were great but I felt that these are a little bit too much, so here are the chapters we would do:

Processes
Threads
Process Synchronization
CPU Scheduling Algorithms
Deadlocks
Main memory
Virtual memory
Virtual Machines

I am skipping the introduction of OS for now as it was not that important, this is going to be a fast article that works like a last night dose.

A simple introduction(May skip if you are already familiar)

An Operating system is like an interface for users and hardware. It is a sort of manager which manages multiple applications. Whether you are headshoting someone on Valorant or making your own programming language you are actually interacting with the OS. But why do we need OS, main reasons being Resource Allocation, Protection, and of course, abstraction. We do not need an OS for a lift, because it has just one functionality, but we do need an OS for a PC due to the complex tasks we expect a PC to perform. These are the types of Operating Systems based on the functionality the OS provide:

Single Tasking: MS-DOS(Only one task) These are very inefficient. The main reason being that the CPU is very fast compared to IO devices and processes which need an IO device may hold the queue up for other processes and keep the CPU idle.
Multiprogramming and Multitasking: Yeah, here is the solution to the above problem. In this we do not wait for the process to completely execute, we just unassign the process when it is doing some IO process and in meanwhile assign some other process. Multitasking is different from multiprogramming as, In multitasking, we assign a small fixed time in which processes execute and change their turn. It is more responsive than multiprogramming because we do not wait for a process indefinitely. It's just the same concept as Round Robin Scheduling.
Multithreading: It's a kind of advancement in Multiprogramming. Here we have multiple threads running in an interleaved fashion inside a process. The foremost advantage is the increased responsiveness of the CPU. Consider MS word, the tool you use to make those assignments. Here one thread is formatting the content and some other, taking input from you. These two are threads involved in just typing some shit in MS word. A thread is the smallest unit assignable to the CPU. Also, we saw that we switch between processes in multitasking, and switching involved a cost. But this cost is less in thread switching compared to process switching. We would see this in context switching.
Multiprocessing: While buying a laptop on these Diwali sales(if you are in India), you might have noticed processors and with more processors increases the price. We have Quad Core in 30-45K INR and Octa-Core processors at 50K+ price. Our laptop uses more than one processor and distributes processes among different processors. This obviously boosts the laptop's speed.
Multi-User: Earlier versions of windows didn't have support for multiple users but Linux always had this support.

Processes

Processes are a program in execution. It has its own memory which is divided into four parts: Text, Data, Heap, and Stack.

Brief explanation about these

Process, as the name suggests, have different states (just like making a joint). Here is a diagram which says it all:

Diagram

Process Control Block(PCB)

It is a data structure that stores all information about a process.

It contains these:

Process Scheduling Queues

This handles the order of execution of the process. We have different queues for different states. When the state of a process is changed, it is simply detached from the current queue and added to its new state's queue.

This says it all

Schedulers

There are three types of schedulers, long term, short term, and mid term schedulers.

Brief explanation

Interrupts and Context Switch

An interrupt is like a signal to stop the current process and allow the higher priority process(the process causing the interrupt) to execute. But what happens to the previous process which CPU was executing? It gets saved! When an interrupt occurs, the system needs to save the current context of the process currently running on the CPU so that it can restore that context. It's like a bookmark we have while reading novels. For example, if you are reading a book on data structures and algorithms and suddenly you realize that watching that Avengers 10 min scene would be far more interesting, you place a bookmark and return after 2 hours to resume the same.

The context is saved in the PCB

Also, please note that taking up a new process requires saving the context of the current process and restore the context of the incoming process. This is known as context switch.

Inter-Process Communication

Inter-Process Communication or IPC for short is used for transferring information from one process to another.

Why we need IPC?

We have studied PCBs. We know Process information is stored in PCBs. And we need some sort of a temporary variable to transfer information(like we do to swap two numbers). And we do have two types of mechanisms on this type of principle.

IPC different mechanisms

Scheduling Algorithms

Terminology

Let's start with algorithms.

First Come First Serve(FCFS)

Shortest Job First(SJS)

Just sort the jobs according to their burst times. And we schedule the jobs according to that order only. It is a non-preemptive algorithm but there does exist a preemptive algorithm for SJS as well.

C++ Code for non-preemptive SJS

#include<bits/stdc++.h>
using namespace std;
#define int long long int
int32_t main()
{
    int n;cin>>n;
    vector<pair<int,int>> times(n);
    vector<pair<int,int>> duration(n);
    // vector<Time>a;
    for(int i=0;i<n;i++)
    {
        cin>>times[i].first>>times[i].second;
        duration[i].second=i;
        duration[i].first=times[i].second-times[i].first;
    }

    sort(duration.begin(),duration.end());

    // Order of jobs is the duration.second order
    // Now we need to find turnaround time, waiting time and
    // Completion time for every process
    int total_waiting_time=0;
    int total_turnaround_time=0;
    int t=0;
    for(int i=0;i<n;i++)
    {
        cout<<"Process "<<duration[i].second+1<<" in ready queue now!\n";
        t=max(times[duration[i].second].first,t);
        t+=duration[i].first;
        int turnaround_time=t-times[duration[i].second].first;
        total_turnaround_time+=turnaround_time;
        cout<<"Turn Around Time: "<<turnaround_time<<"\n";
        int waiting_time=turnaround_time-duration[i].first;
        total_waiting_time+=waiting_time;
        cout<<"Waiting Time: "<<waiting_time<<"\n";
    }
    double avg_turnaround_time=(double)total_turnaround_time/n;
    double avg_waiting_time=(double)total_waiting_time/n;
    cout<<"Average Waiting Time: "<<avg_waiting_time<<"\n";
    cout<<"Average Turn Around Time: "<<avg_turnaround_time<<"\n";


    return 0;
}

This Gantt diagram will help you understand SJS:

Gantt diagram

Disadvantages of SJS

It has a preemptive way also. In this, the trick is to fill those gaps which we were making. Whenever we are idle, we assign the computer another process that can be filled.

C++ code for preemptive code

To be updated

This Gantt diagram will help you understand the preemptive version of SJS:

Gantt diagram

Round Robin Algorithm

Process synchronization

Processes categorized on basis of synchronization

Process synchronization problem arises in the case of Cooperative process also because resources are shared in Cooperative processes.

Race Condition

When more than one process is executing the same code or accessing the same memory or any shared variable in that condition there is a possibility that the output or the value of the shared variable is wrong so for that all the processes doing the race to say that my output is correct this condition known as a race condition. Several processes access and process the manipulations over the same data concurrently, then the outcome depends on the particular order in which the access takes place.

Example

Suppose we have two operations, cnt++ and cnt--, from two different processes acting on a global variable cnt.

++ Operation :

int reg=cnt;
reg=reg-1;
cnt=reg;

-- Operation:

int reg2=cnt;
reg2=reg2-1;
cnt=reg2;

Now, we need to do this operation in this order:

int reg=cnt;
reg=reg-1;
cnt=reg;
int reg2=cnt;
reg2=reg2-1;
cnt=reg2;

But as the resource is shared, it can happen in any order, maybe this one as well:

int reg=cnt;
reg=reg-1;
int reg2=cnt;
cnt=reg;
reg2=reg2-1;
cnt=reg2;

This will lead to cnt's final value as 4 if the initial value is 5.

Critical Section

A critical section is a code segment that can be accessed by only one process at a time. This code segment is common in many processes and if many processes run simultaneously, we would have a hard time finding the process containing the error, if it happens.

Any solution to the critical section must satisfy these rules.

Here is what critical section looks like

Different Solutions to synchronization problems

1) Disabling Interrupts

We know we can have a race condition if we have some interrupt in a preemptive scheduling algorithm(in a single processor system. In a multiprocessor system, we can have a race condition if two processes on different processors execute the critical section). A process can simply announce that it is entering a critical section and the processor should not interrupt it. Well, it works fine, doesn't it? NO, of course not. This can lead to a lot of problems. Firstly, it is not applicable for multiprocessor systems. Secondly, we cannot give a process the freedom to block the interrupts. It can go on indefinitely inside the critical section, disabling interrupts forever.

2) Locks (or Mutex)

Here we have a lock. We acquire the lock, run a process in the critical section and then release the lock. How it's different from Disabling Interrupts? Here only the process which wants to execute inside the critical section wait, all other processes can still interrupt.

There are two types of implementations:

Software: Peterson solution for two processes and Bakery Algorithm for multiple processes.
Hardware: Generally used, because it's faster. We have test and lock instructions and much more.

Peterson's Solution

As we saw earlier, we need a solution for the critical section of code, as it can lead to anomalies. This solution should satisfy the rules we mentioned before.

Simple Psuedo code

int turn;
bool flag[2];
do{
    flag[i]=1;
    turn=j;
    while(flag[j]&&turn==j);
    /////////////////////
    // Critical section//
    /////////////////////
    flag[i]=0;
    ///////////////////////
    // Remainder section///
    ///////////////////////

}
while(1)

How it works

3) Semaphores

Semaphore is nothing but an integer variable, and this can be changed by using only these two operations:

Wait: It is like a decrement operation.

wait(S){
   while(S<=0);
   S--;
}

Signal:

Signal(S){
   S++
}

Semaphores can be counting or binary(0 or 1).

Binary Semaphores are generally called mutex locks as they provide mutual exclusion.

Counting Semaphores are used to control access to a given resource for multiple processes.

Use of semaphores in handling critical section of N processes

Shared data: semaphore mutex// Initially mutex=1
Process p[i]:
do{
   wait(mutex);
   ////////////////////////
   ////critical section////
   ////////////////////////
   signal(mutex);
   ////////////////////////
   ////remainder section///
   ////////////////////////
}while(1);

How does this work?

Busy Waiting problem's solution

problem with semaphores

How we achieve busy waiting problem

4) Monitors

Deadlocks

Perfect example for deadlock

Okay, a serious example for deadlock

Conditions for a deadlock

Methods for deadlock prevention

Deadlock Prevention or Avoidance: We have to prevent any of the four conditions mentioned above.

Let's see which one we can avoid.

2) Deadlock detection and recovery

3) Ignore Deadlock and Reboot the system

Example of this prevention

Problems were some processes along with resources and you have to find the order of execution or which process to remove to avoid deadlocks can be solved by these deadlock resolving techniques. We will be discussing some problems also

Banker's algorithm for avoiding Deadlocks

See the below table: Here initially, we had total resources of type A=10, B=5, C=7. (See it as A printers, B keyboards, C CPUs)

alt text

Here is the terminology used here:

Allocation: Resources already allocated by the system
Max Need: Need of resources by processes. (although we saw in the deadlock avoidance that pre-computing how many resources are needed by a process is not possible but still, if we somehow know the maximum need)
Available: Resources available at a particular time. Here initially we had 10 types of A resources but at the point, we came, the system had already allocated some resources leaving us the mentioned resources.
Remaining Need: We already have allocated some resources. The remaining need is just the maximum need-Allocated.

There can be two things here: detection and sequence of processes. We have to output a sequence of processes where deadlock not happens or say it's impossible.

Now, it's very simple from here. We have the remaining need for every process. We just now have to iterate sequentially and see which process we can execute. If it's executable, we can just execute it and we have the allocated resources of that particular process and the process is executed completely. If we come in a situation where we can't execute any process, we welcome the deadlock.

I will be making a Codeforces problem also on it. You can visualize this as a completely greedy algorithm. But as far as the practical application of this algorithm goes, as we saw previously also, we cannot have the max need of processes. We don't know which process wants what resource and for how long and thus, it's not that practical.

Threads

We saw in types of OS(in the introduction at the start of the blog) that we have multithreading as well. We have multiple threads running in an interleaved fashion inside a process. The foremost advantage is the increased responsiveness of the CPU. A thread is the smallest unit assignable to the CPU.

Interesting thing about threads

Advantages and Disadvantages of threads

Types of Threads

User Level Thread: A process is creating multiple threads and the kernel is not at all aware of these threads being created. Here we don't need to call the OS and there are no system calls involved making it much faster for context switching. But we have both advantages and certain disadvantages as well. We get fast context switching, and very fast creation and termination time because of no involvement of kernel. But if even a single threads make a call that waits, then all of the other threads in the process have to wait for that call to complete. This is because the call would be made to kernel obviously, and kernel considers the thread as the whole process and would not entertain another call of the same process thread.
Kernel Level Threads: Kernel knows and manages the threads. OS kernel provides system calls to create and manage threads. Kernel Level threads require the involvement of the kernel, which means that you have to do a MOR switch(Google it), you need to go from User space to the kernel space, and then you need to do the switch. And then we have to schedule some other thread as well.

Difference between the two

Feature	User Level Thread	Kernel Level Threads
Management	In User space	In kernel space
Context Switching	Fast	Slow
Blocking	One thread might block all other threads	A thread block itself only
Multicore or Multiprocessor	Cannot take advantage of multicore system. Only concurrent execution on single processor	Take full advantage of the multicore system.
Creation/Termination	Fast	Slow

Mapping of User threads to Kernel threads

One to One: Most common way, used in common Operating Systems like windows. Every user thread is mapped to one kernel thread. Very similar to a pure kernel thread. They are slower than many to one but we can use multicore systems efficiently here.

Many to One: Many users are mapped to one kernel thread. They can be seen as pure user threads. They do provide fast context switching along with fast creation and termination but we cannot utilize multicore systems and all the threads can be blocked by one thread making a system call.
Many to Many: Very rarely used. Many user threads are mapped to many kernel threads.

Memory Management

Comments (14)

Show archived | Write comment?

sparkles

4 years ago, # |

Thanks for this! However i think if these all can be provided in pdf form /similar it would be of much more help.

→ Reply

abdude824

4 years ago, # ^ |

← Rev. 2 →

Actually I love the spoiler function of codesforces(

This one

)

This enables efficient reading and I love to revise with it afterwards.

approach

thanks, next DBMS please.

-is-this-fft-

+132

Would also like to read some notes on the history of the Seleucid Empire.

last.winged.hussar

Yes, please can you share some? Also of the Greek, Macedonian and the Achamenid empires, if you have. Thank you.

+16

If I could share some I wouldn't ask ;)

Oh never mind, I read it as "Would you like to.."

SharpC

World history (ed. Zhukov) in 10 volumes, volume 2, page 247

Blinding_Lights

Got the real-world scenario of deadlock from Perfect example of deadlock. lol

AbhilashaBansal

kamaal

Snapper_001

21 month(s) ago, # |

we can make semaphore like
struct sem{
    int count;
    queue q;  //for wait the process to avoid busy waiting
};

lis05

+13

Btw I use Arch btw

ko_osaga

Please fix my Pintos code, I have 12 failed tests, and the due is next Monday

FAIL tests/vm/page-merge-par
FAIL tests/vm/page-merge-stk
FAIL tests/vm/page-merge-mm
FAIL tests/vm/mmap-write
FAIL tests/vm/mmap-exit
FAIL tests/vm/mmap-shuffle
FAIL tests/vm/mmap-inherit
FAIL tests/vm/mmap-off
FAIL tests/vm/swap-file
FAIL tests/vm/swap-anon
FAIL tests/vm/swap-iter
FAIL tests/vm/cow/cow-simple

ye_kaun_hai

2 months ago, # |

Where is the next part of this?? This is not completed yet

abdude824's blog