COMP 3400 Lecture 3: Processes
[ previous
| schedule
| next ]
What is a process?
- a program in execution
- major components of a process include
- its program code (text)
- its global variables (data)
- its temporary data (stack)
- its dynamically-allocated data (heap)
- its program counter and register values
- multiple processes exist simultaneously. In Windows do Ctrl-Alt-Del then
click Task Manager then Processes; in Linux type ps -e (or ps -el for long format)
- multiple processes can be associated with the same program
- each process operates in a separate space from other processes
Each process has identity
Every process is given an identifier when created, typically an integer value known as
its Process I.D. (PID).
Each process has state
- Recall in OOP that each object has a state, which is the collection
of its fields' values
- Processes have state too; it changes over time
- new, initial state while being created
- ready, "eligible for execution"
- running, using the CPU
- waiting, waiting for an event (not running, not ready)
- terminated, finished with CPU but still exists
- process is represented by data structure called process control block, which contains info such as:
- process state
- saved program counter
- saved register values
- list of files and devices in use
- memory management info
- process management info
- thread info
- OS maintains all necessary info about the process in its PCB.
Scheduling and running Processes
- concurrent processing: multiple processes exist at the same time
but only one may be in running state - single CPU
- parallel (simultaneous) processing: multiple processes are running at
the same time - multiple CPUs
- scheduling is important policy whether concurrent or parallel processing
- PCBs spend their existence moving from queue to queue.
- "ready" queue if in ready state
- "device" queue if waiting for I/O
- etc
- process schedulers are kernel code to maintain queues
- a long-term scheduler runs occasionally to keep right mix of processes in memory
- a short-term CPU scheduler runs frequently to keep right mix of processes on CPU
- the corresponding mechanism is context switching
- context switching involves switching "current process" pointer from one PCB to another
and saving/loading register values
Process creation
- A process can be created only by another running process
- The creator is the parent process, the created is its child process
- A parent can create many children; a child has but one parent (whose PID is stored in its PCB)
- child may be constrained to using its parent's resources, may not
- parent can pass initialization data and possibly resources to child upon creation
- parent execute concurrently with its children and/or await child termination
- child process may be created as clone of parent
- details on Unix process creation:
- parent creates child by calling fork system call
- child is clone of parent (duplicate process space and most of PCB)
- by default both continue from same program counter
- in reality, child is given different code to execute:
- return value of fork is 0 for child, child's PID for parent. This is
used as condition of if-then-else
- one or the other can then overwrite its own code space with executable code
from a file by calling exec system call
- the newly-loaded code subsequently runs under the same PID
- the term exec above refers to a group of related system calls
- to see Linux process list, do command: ps -e (or ps -el for long format)
Process termination
- Unix process terminates by calling exit system call. argument
value passed to parent if waiting
- it may also be terminated externally (by parent or kernel)
- Parent is notified when child terminates, and parent resumes at statement following wait
(if it was waiting)
- Some OSs do not allow parent to terminate until all children have; Unix
allows this but requires child to be adopted by special kernel init
process (PID 1).
Simple non-robust Unix example
#include <stdio.h>
#include <unistd.h>
int main() {
if (fork()) {
printf("I am parent, my PID is %d.\n",getpid());
wait();
printf("My child is terminated, I am leaving too.\n");
} else {
printf("I am child, my PID is %d.\n",getpid());
}
exit(0);
}
Sample output:
I am parent, my PID is 1265.
I am child, my PID is 1266.
My child is terminated, I am leaving too.
Concurrent Cooperating processes
With concurrency, multiple processes exist at the same time. If they communicate with each
other or depend on each other, they are called cooperating processes.
Such communication of data or information is called interprocess communication (IPC). IPC
mechanisms fall into two categories: shared memory and message passing
The classic example of cooperating processes is the producer-consumer
problem.
This is nicely illustrated by the classic I Love Lucy chocolate factory scene.
- One process, the producer, produces information.
- Another process, the consumer, consumes that information.
- They do not run in lockstep because this is too
much a constraint.
- They share a limited-size buffer to hold the information.
- The only producer constraint: if buffer is full, has to wait until a slot free
- The only consumer constraint: if buffer is empty, has to wait until a slot fills
The producer and consumer cooperate through the shared buffer. There are variations on the problem
for unbounded (infinite) buffer, bounded (finite) buffer, and zero capacity buffer.
Message-passing IPC facilities
are simple in concept: send(toID, message) and receive(fromID,
message) are the only calls required. They are varied and possibly complex in
implementation.
Processes communicating via IPC may communicate directly, using their
partner's process IDs as the toID and fromID. Alternatively,
they may communicate indirectly, using a port or mailbox
as the toID and fromID. In this case, both are the same ID
and it must be accessible to both parties. Ports and mailboxes are akin to buffers
because they are an intermediary. Communication is not limited to two parties:
anyone who has access to the same shared ID can participate. Kind of like IM
or chat.
The send and receive operations can be implemented differently
to achieve different degrees of synchronization between sender and
receiver.
- send can be implemented using blocking,
which means the sender is blocked until the message is received by the receiver
or mailbox.
- receive can be implemented using blocking, which means
the receiver blocks until a message is sent.
- If there is no buffering between them, they are strictly synchronized.
- If there is an "infinite" buffer, the sender will never block but the receiver
still blocks on empty buffer.
- If you desire both sender and receiver to be non-blocking, then you must
provide an infinite buffer and have receiver return a null if buffer is empty.
Unix support for producer/consumer: pipe
The Unix pipe() system call creates a bounded buffer (size is system-dependent)
through which two processes may interact, one as producer and the other as consumer.
The producer will block if the pipe is full and the consumer will block if the
pipe is empty.
To make the pipe shareable, it must be created before a fork(), then
after the fork the parent and child processes both have access to the pipe and
communicate by reading/writing to it.
Here is an example showing the setup of a pipe then after the fork the parent
sends the child some characters through the pipe and the child reads and prints
them. The pipe-related code is shown in boldface.
#include <stdio.h>
#include <sys/types.h>
#include <unistd.h>
#define BUFFER_CAPACITY 6
char *packets[] = { "Hello there" , " from me." };
int main() {
int channels[2], readChannel, writeChannel, pipeCreateFailed;
pid_t processID;
pipeCreateFailed = pipe(channels);
if (pipeCreateFailed) {
printf("pipe failed.\n");
exit(-1);
} else {
readChannel = channels[0];
writeChannel = channels[1];
}
processID = fork();
if (processID < 0) { /* fork failure */
printf("fork failed.\n");
exit(-1);
} else if (processID > 0) { /* parent code */
int numPackets, i;
close(readChannel);
numPackets = sizeof(packets)/sizeof(char *);
for (i=0; i<numPackets; i++) {
write(writeChannel, packets[i], strlen(packets[i]));
}
close(writeChannel);
wait();
printf("parent terminates\n");
} else { /* child code */
char buffer[BUFFER_CAPACITY+1];
int bytes;
close(writeChannel);
while ( (bytes = read(readChannel, buffer, BUFFER_CAPACITY)) != 0 ) {
buffer[bytes] = '\0';
printf("%d chars :%s: received by child\n", bytes, buffer);
}
close(readChannel);
printf("child terminates\n");
}
exit(0);
}
This code requires a little explanation:
- the parameter to pipe() is an array of two int.
- pipe() creates a file descriptor ("channel") for reading into element 0.
- pipe() creates a file descriptor ("channel") for writing into element 1.
- the process doing the writing (the parent in this case) must first close its read
file descriptor.
- the process doing the reading (the child in this case) must first close its write
file descriptor.
- writing to the pipe is done here using the write() system call
(higher level functions can also be used).
- its first parameter is the file descriptor for writing
- its second parameter is the address of the data to be written
- its third parameter is the number of bytes to write
- its return value is the number of bytes written
- reading from the pipe is done here using the read() system call
(higher level functions can also be used).
- its first parameter is the file descriptor for reading
- its second parameter is the address of the buffer to hold the data read
- its third parameter is the number of bytes to read
- its return value is the number of bytes actually read (if 0, writer has closed its pipe!)
Internet Client-Server communication: sockets
Sockets are a low-level IPC (InterProcess Communication) mechanism through which client and server processes
may communicate with each other across the Internet. Both may reside on the same machine or
on different machines.
This is an asymmetric arrangement. Before communication can occur, the server process
must be running and listening for client requests on a specific port. In order to make
a request, the client must first know the IP address and port number associated with
the server process. It makes the request, rendezvous occurs and communication proceeds according
to the protocol defined by their respective programs. The server continues to run after finished
with the client.
Here is a simple sockets-based application for an upper-case server. Client sends a string and
the server converts the string to all upper case then sends the result back. The example code
is shown in Java because it is somewhat cleaner than the equivalent C code. All input and
output is done through stream objects that are attached to the socket.
The Client
import java.io.*;
import java.net.*;
class TCPClient {
static final String SERVER = "PeteSanderson.otterbein.edu";
static final int PORT = 6789;
public static void main(String argv[]) throws Exception {
String sentence;
String modifiedSentence;
BufferedReader inFromUser =
new BufferedReader(new InputStreamReader(System.in));
Socket clientSocket = new Socket(SERVER, PORT);
DataOutputStream outToServer =
new DataOutputStream(clientSocket.getOutputStream());
BufferedReader inFromServer =
new BufferedReader(new
InputStreamReader(clientSocket.getInputStream()));
sentence = inFromUser.readLine();
outToServer.writeBytes(sentence + '\n');
modifiedSentence = inFromServer.readLine();
System.out.println("FROM SERVER: " + modifiedSentence);
clientSocket.close();
}
}
The Server
import java.io.*;
import java.net.*;
class TCPServer {
static final int PORT = 6789;
public static void main(String argv[]) throws Exception {
String clientSentence;
String capitalizedSentence;
ServerSocket welcomeSocket = new ServerSocket(PORT);
while(true) {
Socket connectionSocket = welcomeSocket.accept();
BufferedReader inFromClient =
new BufferedReader(new
InputStreamReader(connectionSocket.getInputStream()));
DataOutputStream outToClient =
new DataOutputStream(connectionSocket.getOutputStream());
clientSentence = inFromClient.readLine();
capitalizedSentence = clientSentence.toUpperCase() + '\n';
outToClient.writeBytes(capitalizedSentence);
}
}
}
[ COMP 3400
| Peter Sanderson
| Math Sciences home page
| Otterbein
]
Last updated:
Peter Sanderson (PSanderson@otterbein.edu)