searching a vector we can search for one occurrence, return true/false or the index of occurrence...

Searching a vector We can search for one occurrence, return true/false or the index

of occurrence Search the vector starting from the beginning Stop searching when match is found

We can search and count the number of occurrences and return count Search entire vector Similar to one occurrence search, but do not stop after first

occurrence

We can search for many occurrences, but return occurrences in another vector rather than count

In all these cases, we search the vector sequentially starting from the beginning This type of search is called “sequential search”

Counting searchint countmatches(const vector<string> & a, const string& s)// post: returns # occurrences of s in a{ int count = 0; int k; for(k=0; k < a.size(); k++) { if (a[k] == s) { count++; } } return count;}

How can we change this code to return the index of the first occurrence? see next slide

One occurrence searchint firstmatch(const vector<string> & a, const string& s)// post: returns the index of occurrence of s in a, -1 // otherwise{ int k; for(k=0; k < a.size(); k++) { if (a[k] == s) { return k; } } return -1;}

Does not search the entire array if one match is found good for efficiency purposes

How could you modify this to return true/false?

Collecting searchCollect the occurrences in another vector

void collect(const vector<string> & a, vector<string> & matches)// pre: matches is empty// post: matches contains all elements of a with// first letter 'A'{ int k; for (k=0; k < a.size(); k++) {

if (a[k].substr(0,1) == "A") {

matches.push_back(a[k]); } }}

Binary searchAlternative to sequential search for sorted vectorsIf a vector is sorted we can use the sorted property to

eliminate half of the vector elements with one comparisonWhat number (between 1 and 100) do we guess first in

number guessing game?

Idea of creating program to do binary searchCheck the middle element

If it has the searched value, then you’re done! If not, eliminate half of the elements of the vector

search the rest using the same ideacontinue until match is found or there is no match

how could you understand that there is no match?let’s develop the algorithm on an example

we need two index values, low and high, for the search space

Binary Search (search for 62)

10 24 34 52 55 62 67 75 80 81 90 92 100 101 111

low=0 mid=7 high=14

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

low=0 mid=3 high=6

low=4 high=6

mid=5 => FOUND

Binary Search (search for 60)

10 24 34 52 55 62 67 75 80 81 90 92 100 101 111

low = 0 mid=7 high =14

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

low=0 mid=3 high=6

low=4 high=6mid=5

low=4 high=4mid=4

low=5 high=4 => NO MATCH FOUND – STOP

Binary search codeint bsearch(const vector<string>& list, const string& key)// pre: list.size() == # elements in list// post: returns index of key in list, -1 if key not found{ int low = 0; // leftmost possible entry int high = list.size()-1; // rightmost possible entry int mid; // middle of current range

while (low <= high) { mid = (low + high)/2; if (list[mid] == key) // found key, exit search { return mid; } else if (list[mid] < key) // key in upper half { low = mid + 1; } else // key in lower half { high = mid - 1; } } return -1; // not in list}

Comparing Sequential and Binary Search

• Given a list of N elements:• Binary search makes on the order of log N operation

O(log N)• Linear (sequential) search takes on the order of N operations

O(N)

SortingOne of the fundamental operations in Computer

ScienceGiven a randomly ordered array, sort it

ascendingdescending

Many algorithms existssome in Chapter 11we will discuss two of them – Selection Sort (11.1.1)

and Insertion Sort (11.1.2)Analysis in 11.4

Selection SortN is the number of elements in vector/arrayFind smallest element, move into 0th vector/array location

examine all N elements 0 .. N-1

Find next smallest element, move into 1st location0th location is already the minimumexamine N-1 elements

1 .. N-1

Find next smallest element, move into 2nd location0th and 1st locations are already the minimum two elementsexamine N-2 elements

2 .. N-1

Generalizefor kth element, 0 <= k <= N-2 - find the minimum between kth and last element (element

with index N-1) of array - swap the kth element with the minimum one

Selection Sort: The Code

void SelectSort(vector<int> & a)// pre: a contains a.size() elements// post: elements of a are sorted in non-decreasing order{ int j, k, temp, minIndex, numElts = a.size(); for(k=0; k < numElts - 1; k++) { minIndex = k; // minimal element index for(j=k+1; j < numElts; j++) { if (a[j] < a[minIndex]) { minIndex = j; // new min, store index } } temp = a[k]; // swap min and k-th elements a[k] = a[minIndex]; a[minIndex] = temp; }}

insertion and deletionIt’s easy to insert at the end of a vector, use push_back()However, if the vector is sorted and if we want to keep it sorted,

then we can’t just add to the end We have to find an appropriate position to insert the element and do

some shifts.

If we need to delete an element from a sorted vector, how can we “close-up” the hole created by the deletion?Shift elements left by one index, decrease sizeWe decrease size using pop_back()

pop_back() changes size, not capacity

Inserting an element into a sorted array

2 7 11 18 21 22 26 89 99

Insert NewNum which is e.g. 23

• Is the array “capacity” sufficient for an extra element?• What would you do to insert newNum in the right spot?

Insertion into sorted array

2 7 11 18 21 22 26 89 99

NewNum = 23

2 7 11 18 21 22 26 26 89 99

0 1 2 3 4 5 6 7 8

0 1 2 3 4 5 6 7 8 9

Insert into sorted vector

void insert(vector<int>& a, int newnum) // NOT const vector// pre: a[0] <= … <= a[a.size()-1], a is sorted// post: newnum inserted into a, a still sorted{ int count = a.size(); //size before insertion a.push_back(newnum); //increase size – newnum is inserted at // the end but the inserted value is not important int loc = count; // start searching insertion loc from end while (loc > 0 && a[loc-1] > newnum) { a[loc] = a[loc-1]; loc--; // shift right until the proper insertion cell } a[loc] = newnum; //actual insertion }

See vectorproc.cpp (not in book)

What about deletion?Remove the element at a given position (pos)

void remove(vector<string>& a, int pos)// post: original a[pos] removed, size decreased{ int lastIndex = a.size()-1; a[pos] = a[lastIndex]; a.pop_back();}

What about if vector is sorted, what changes?What’s the purpose of the pop_back() call?

Deletion from sorted vector

2 7 11 18 21 22 26 89 99

0 1 2 3 4 5 6 7 8

Ex: Delete element at position 3

2 7 11 21 22 26 89 99 99

0 1 2 3 4 5 6 7 8

2 7 11 21 22 26 89 99 99

0 1 2 3 4 5 6 7

Size is 9

Size is now 8

First shift all elements on the right of 3rd element one cell to the left

pop back the last element of vector

Deletion from sorted vectorvoid remove(vector<int> & a, int pos)// pre: a is sorted// post: original a[pos] removed, a is still sorted{ int lastIndex = a.size()-1; int k; for(k=pos; k < lastIndex; k++) { a[k] = a[k+1]; } //shift all elements on the right of pos one cell left a.pop_back(); //remove the last element of the array}

Does pop_back() actually remove an element?no, it just decreases the size so that the last element

becomes unreachableCapacity remains the same

See vectorproc.cpp (not in book)

Insertion Sort

Insert 1st element before or after 0th

first 2 sortedInsert 2nd element (element with index 2) in

proper locationfirst 3 sorted

Generalizeinsert kth element (element with index k) within first

k elementsfirst k+1 sorted

run this for all k between 1 .. N-1

Insertion Sort – The Codevoid InsertSort(vector<string> & a)// precondition: a contains a.size() elements// postcondition: elements of a are sorted in non-decreasing order

{ int k,loc, numElts = a.size(); for(k=1; k < numElts; k++) { string hold = a[k]; // insert this element loc = k; // location for insertion

// shift elements to make room for hold (i.e. a[k]) while (0 < loc && hold < a[loc-1]) { a[loc] = a[loc-1]; loc--; } a[loc] = hold; }}

Which one faster?No exact answer! It depends on the vector to be sorted

already ordered, totally disordered, randomLet’s see how many iterations do we have in Selection Sort

Outer loop k: 0 .. N-2 Inner loop j: k+1 .. N-1

(N-1) + (N-2) + (N-3) + .. + 1 = N(N-1)/2 = (N2 – N)/2Worst case, best case, average case???Complexity is O(N2)

order of N2

Big-oh notation used to describe algorithmic complexities. This is not a precise amount of operations and comparisons. Minor terms and coefficients are not taken into consideration

Which one faster?Let’s analyze Insertion Sort

Outer loop k: 1 .. N-1 N-1 iterations for the outer loop

What about inner loop? worst case, best case differ worst case: k times, so total is 1+2+3+…+(N-1) = N(N-1)/2, complexity is

O(N2) best case: inner loop does not iterate, complexity is O(N), but best case

complexity analysis is not done too often what are the best and worst cases? average case: inner loop iterates k/2 times, order is still O(N2)

Complexities of both Selection and Insertion Sort are O(N2)Which one would you prefer to use?Let’s run timesorts.cpp (modified from book) – needs several

Tapestry .h and .cpp files in the project folder to run (comparer.h, ctimer.h, ctimer.cpp, prompt.h, prompt.cpp, randgen.h, randgen.cpp, sortall.h, sortall.cpp – red ones to be added to the project).

Built-in ArraysC++ native array typeTwo versions

fixed size arraysarray size is fixed and must be specified with a constant

expression at the declarationwe will see this type now

array pointersarray size is dynamically allocatedwe will not see it in this course

use of both types are the same except definitionvector versus built-in arrays

vector is a class based on built-in arraysvector has member functions and operators, built-in

arrays do NOTvector is more flexible, but slower

Built-in Array declaration As we said, we will discuss fixed size built-in arrays, not the pointer

version with dynamic allocation size must be able to be determined at compile time

constant, literal or an expression that involves constants and literals only

const int CLASSSIZE = 100; // constant

string names[CLASSIZE]; // array of 100 stringsdouble grades[CLASSIZE*5]; // array of 500 doublesint list[200]; // array of 200 integers

The following array declaration is INVALID

int size;cout "Enter how many students ? ";cin >> size;string names[size]; // array size cannot be a variable

Built-in array initialization at declaration

You may specify a list of initial values at declaration. See following example:

string dayNames [] = {"Sunday", "Monday", "Tuesday", "Wednesday", "Thursday","Friday",

"Saturday"};

dayNames is an array with 7 elements of type string 0th element is “Sunday”, 1st is “Monday”, ...

not necessary to specify size (7), since the number of elements make the size clear but you can specify the size, if you wish

string dayNames [7] = {"Sunday", "Monday", "Tuesday", "Wednesday",

"Thursday","Friday", "Saturday"};

Assignment rules in arraysvectors with the same element type can be assigned to

each other by =LHS vector becomes the same as the RHS vector

size and capacity also become the same

Built-in arrays cannot be assigned to each other by =

int coins[] ={1,5,10,25};int temp[4];

temp = coins; // illegaltemp[1] = coins[2]; // legal – array element assignment

How can we assign coins to temp?element by element

for (i=0; i<4; i++) temp[i] = coins[i];

Passing built-in arrays as parametersA built-in array can be passed only as reference

parameter or const-reference parametercannot be passed as value parameter

But, we do not use ampersand character, &, at parameter declarationand we do not specify the array size in array parameterhowever array size is generally passed as another

integer parameter since we do not have a size() member function for built-in arrays

void Change(int list[], int numElts);

void Print(const int list[], int numElts);

reference parameter

const-reference parameter

Built-in array demoSee fixlist.cpp (slightly modified from the version in

book)Why did we use const in Print?

to avoid accidental changes in array list

Why did we pass numElts as parameter for the number of elements in the array? because we don’t know the total number of elements in the

array while writing the functions

Example – Fibonacci numbersUsed in many areas of Mathematics and Computer Science

F0 = 1F1 = 1Fn = Fn-1 + Fn-2

1 1 2 3 5 8 13 21 34 55 89 144 233 377 610 987You can see many examples of Fibonacci numbers in nature

E.g. Increase of number of branches of trees in time See http://www.mcs.surrey.ac.uk/Personal/R.Knott/Fibonacci/fibnat.html

for more examples

const int MAX_SIZE = 100; int list[MAX_SIZE];

int k; list[0] = list[1] = 1; for (k=2; k < MAX_SIZE, k++) {

list[k] = list[k-1]+list[k-2];}

Use of strings as arraysCharacters in a string can be referred as an array using [ ]

string s="cs201";...s[0] = 'n'; // makes 0th character of s 'n'... for (k=0; k<s.length(); k++) cout << s[k] << " ";

In general,s[k] means s.at(k)

The MatrixTo represent two dimensional arrays

Given rows and columns:We define a matrix as a vector of vectors

vector<vector<int>> mat(rows, vector<int>(cols));vector<vector<int>> mat(3, vector<int>(5));

mymatrix[2][3] = 100;

First index is for row, second is for column

100

0 1 2 3 40

1

2

Possible Matrix definitionsPossible matrix declarations

4 different declarations

vector<vector<int>> matrix_name; empty matrix (zero rows, zero columns)

vector<vector<int>> matrix_name(rows); matrix with rows many empty vector<int>’s

vector<vector<int>> matrix_name(rows, vector<int>(cols)); matrix with rows*cols elements all initialized to 0

vector<vector<int>> matrix_name(rows, vector<int>(cols,

init_value)); matrix with rows*cols elements all initialized to init_value

To get the size of rows and columnsmymatrix.size()

number of rows in matrix

mymatrix[0].size()number of columns in matrix

Example: Let’s run matdemo.cpp

searching a vector we can search for one occurrence, return true/false or the index of occurrence...

Documents