Home >  T.A >  CS179G ( Spring 2003 ) >  Projects >  Project Phase #1

  Project Phase #1 - Buffer Manager
due date: Thursday 24 April 2003

Introduction Notes   ::   Lab 1 Notes   ::   Minibase Tutorial   ::   phase1.tar.gz   ::   Browse Files
Love/Hate Venn Diagram   ::  

Introduction

In this phase, you will have to implement a simple buffer management layer (without support for concurrency control and recovery) for the Minibase database system. The code for the underlying Disk Space Manager will be given. HTML documentation for Minibase is available on the web (http://www.cs.wisc.edu/coral/minibase/project.html). In particular, you should read the description of the DB class, which you will use extensively in this assignment.

Note: The Minibase buffer manager layer differs from what you have to implement in that it contains methods to support concurrency control and recovery.

Preliminary Work

  1. Download and study the Introductionary Notes in Database Architecture which will help you remember what is each component responsible for.Download here

  2. Download and study the Minibase Minibase Short Tutorial which will help you to understand how error handling and the Storage Manager work in Minibase.Download here

  3. Read the description of the Buffer Manager from the book (Textbook section 7.4 2nd edition OR 9.4 in 3rd edition.

Read the description in the text and follow this page organization.

Compiling Your Code and Running the Tests

Please Download the the files for Phase 1 from here phase1.tar.gz into your working directory.

  1. mkdir project
  2. cd project
  3. tar -zxvf phase1.tar.gz
  4. cd phase1-bm/
Now you will see 3 generated directories:
  1. lib/
  2. include/
  3. src/
src/ contains the files you will be working on. If you cd src/ and then you will see the files you will be working on. Please edit the Makefile and change the MINIBASE variable to
MINIBASE=/class/csud/cs179g/cs179gXX/project/phase1-bm,
where cs179gXX is your personal account. Before you make the project you need to make depend which will append several lines at the end of the Makefile. Now you can make the project, which will create an executable named buftest.
Right now, it does not work; you will need to fill in the bodies of the BufMgr class methods + others. The methods are defined (empty) in file buf.C.

Sample output of a correct implementation is available in sample_output.

The Buffer Manager Interface
 The simplified buffer manager interface that you will implement allows a higher level program to allocate and deallocate pages on disk, to bring a disk page to the buffer pool and pin it, and to unpin a page in the buffer pool.

The methods that you have to implement are described below:
class BufMgr {

public:
// This is made public just because we need it in your
// driver test.C . It could be private for real use.

Page* bufPool;
// The physical buffer pool of pages.

public:

BufMgr(int numbuf, Replacer *replacer = 0);
// Allocate "numbuf" pages (frames) for the pool in main memory.

~BufMgr();
// Should flush all dirty pages in the pool to
// disk before shutting down and deallocate the
// buffer pool in main memory.

Status pinPage(PageId PageId_in_a_DB, Page*& page,int emptyPage=0);
// Check if this page is in buffer pool. If it is, increment the pin_count
// and return a pointer to this page. If the pin_count was 0 before the
// call, the page was a replacement candidate, but is no longer a candidate
// If the page is not in the pool, choose a frame (from the set of replacement
// candidates) to hold this page, read the page (using
// the appropriate DB class method) and pin it.
// Also, must write out the old page in chosen frame if it is dirty
// before reading new page. (You can assume that emptyPage == 0 for
// this assignment.)

Status unpinPage(PageId globalPageId_in_a_DB, int dirty, int hate);
// hate should be TRUE if the page is ?hated? and FALSE otherwise.
// Should be called with dirty==TRUE if the client has
// modified the page. If so, this call should set the dirty bit
// for this frame. Further, if pin_count>0 should decrement it.
// If pin_count=0 before this call, return error.

Status newPage(PageId& firstPageId, Page*& firstpage,int howmany=1);
// Find a frame in the buffer pool for the first page
// If a frame exists, call DB object to allocate a run of new pages and
// and pin it. (This call allows a client of the Buffer Manager
// to allocate pages on disk.) If buffer is full, i.e., you
// can't find a frame for the first page, return error.

Status freePage(PageId globalPageId);
// This method should be called to delete a page that is on disk.
// This routine must call the DB class to deallocate the page.

Status flushPage(int pageid);
// Used to flush a particular page of the buffer pool to disk
// Should call the write_page method of the DB class

Status flushAllPages();
// Flush all pages of the buffer pool to disk
};


Design Overview and Implementation Details

The buffer pool is a collection of frames (page-sized sequence of main memory bytes) that is managed by the Buffer Manager. It should be stored as an array bufPool[numbuf] of Page objects. In addition, you should maintain an array bufDescr[numbuf] of descriptors, one per frame. Each descriptor is a record with the following fields:

page number, pin_count, dirtybit

The pin_count field is an integer, page number is a PageId object, and dirtybit is a boolean. This describes the page that is stored in the corresponding frame. A page is identified by a page number that is generated by the DB class when the page is allocated, and is unique over all pages in the database. The PageId type is defined as an integer type in minirel.h.

A simple hash table should be used to figure out what frame a given disk page occupies.The hash table should be implemented (entirely in main memory) by using an array of pointers to lists of <page number, frame number> pairs. The array is called the directory and each list of pairs is called a bucket. Given a page number, you should apply a hash function to find the directory entry pointing to the bucket that contains the frame number for this page, if the page is in the buffer pool. If you search the bucket and don't find a pair containing this page number, the page is not in the pool. If you find such a pair, it will tell you the frame in which the page resides. This is illustrated in Figure 1:
 
 

The hash function must distribute values in the domain of the search field uniformly over the collection of buckets. If we have HTSIZE buckets, numbered 0 through M-1, a hash function h of the form

works well in practice. HTSIZE should be chosen to be a prime number. When a page is requested the buffer manager should do the following: Check the buffer pool (by using the hash table) to see if it contains the requested page. If the page is not in the pool, it should be brought in as follows:

  • Choose a frame for replacement, using the LOVE/HATE replacement policy.
  • If the frame chosen for replacement is dirty,  flush it (i.e., write out the page that it contains to disk, using the appropriate DB class method).
  • Read the requested page (again, by calling the DB class) into the frame chosen for replacement; the pin_count and dirtybit for the frame should be initialized to 0 and FALSE, respectively.
  • Delete the entry for the old page from the Buffer Manager's hash table and insert an entry for the new page. Also, update the entry for this frame in the bufDescr array to reflect these changes.
  • Pin the requested page by incrementing the pin_count in the descriptor for this frame and return a pointer to the page to the requester.
The Love/Hate replacement policy

Theoretically, the best candidate page for replacement is the page that will be last requested in the future. Since implementing such policy requires a future predicting oracle, all buffer replacement policies try to approximate it one way or another. The LRU policy, for example, uses the past access pattern as an indication for the future. However, sequential flooding can ruin this scheme and MRU becomes more appropriate in this particular situation. In this assignment you are supposed to implement the love/hate replacement policy. The policy tries to enhance prediction of the future by relying on a hint from the upper levels about the page. The upper level user hints the buffer manager that the page is loved if it is more likely that the page will be needed in the future, and hated if it is more likely that the page will not be needed. The policy is supposed to maintain an MRU list for the hated pages and an LRU list for the loved pages. If a page is needed for replacement, the buffer manager selects from the list of hated pages first and then from the loved pages if no hated ones exist.

A situation may arise when a page is both loved and hated at the same time. This can happen if the page was pinned by two different users and then was unpinned by the first one as a hated page and by the other as a loved page. In this case, assume that "love conquers hate", meaning that once a page is indicated as loved it should remain loved. 

What to Turn In

You are required to turn in your copy of all source files through the online secure site https://www.cs.ucr.edu, this includes all the files needed to make buftest and run the test program. The TAs should be able to go to your handin directory and type make and buftest, to run the program.

Please remember late submissions will not be accepted. Make sure to start early!



Top


sitemaster: Demetris Zeinalipour