cs 474 database design and application terminology jan 11, 2000
TRANSCRIPT
Database Database - Collection of related data with
implicit meaning.• Your cards are a database.• The phone book is a database.• Your filing systems are databases.
Data Data - Known facts that can be recorded
and have meaning.• Your cards are the database and what you have
recorded (written) on them is the data.• The entries in the phone book are data.• The information written in your file system is
data.
Databases, cont Collection of related records with specific
meanings• Random assortment of data is not a database.• Designed for a specific purpose. • Have a source for the data.• Have a specific audience who will use the data.• Represents some aspect of the real world.
– models real world entities and real world processes.
Database Management System Abbreviated DBMS. Collection of programs to enable users to
build and maintain databases. General purpose software, like a
programming language.
Database Management Systems, cont
Your book talks about a database system as being a “computerized record-keeping system”. Databases do not need to be on computers, but they do keep records.
Our focus in class will be (of course) the computerized database system.
What makes up a database? Database - central file store. File - a collection of individual records. Record - is a collection of data elements
about one entity. Data element - a single piece of data with
properties.
Operations on Databases Creation and removal of files Insertion and deletion of individual records
within files. Changing data within existing files. Retrieving data from existing files.
Database system components Data Hardware on which it resides Software that manipulates the data Users that maintain or use the data
Data In a database, data are organized in such a
way that we can make efficient use of the data. (Files in a cabinet are organized to make insertion, deletion, and retrieval easier).
The book talks about integretation of data and sharing.
Integration Separate diverse data (files) are organized
in such a way that we can make use of disparate data across the system. For example, if I am in the Admissions office and I want to know which of my recruits from last year are still in school, I would access the Registration information. Integration leads to efficiency of operations.
Integration Also means that we can reduce the
redundancy of the data. If both the Admissions office and the Registrar’s office need names of students, we can store the names in one file and both offices can use those names with their individual files.
Redundancy This property allows us to control the
redundancy in the system, by allowing us to put common data in one place which is accessible from everywhere in the system.
Redundancy - duplication of data across systems.
Why is redundancy bad? Loss of efficiency - as each office is collecting
data, each person is spending time gathering data that the organization already has in hand.
Inconsistency - a change in data in one place is not necessarily propogated to all offices.
Storage is wasted - extra space to house the duplicate information.
Shared data By integrating the database, you permit
your users to share data across offices. That enables queries, research and
processes to use the entire collection of data.
Is all data shared? Typically, no. Each user will have his/her own view of the
data. That view will be customized to their needs.
Some data should not be viewed by all users. Protections (security) prevent unauthorized access to data.
Hardware Hardware provides the “file cabinet” for the
database. It provides a place on which the data resides.
Hardware also provides us with some tools to maintain the data (keyboard, tapes, modem, memory).
Software - DBMS The Database Management System
(DBMS) provides a layer between the stored data and the users.
All requests for manipulation of data go through the DBMS.
Services typically provided Design the database. Includes ability to
build new files, remove files, add elements to files, establish the “connections” between files.
Manipulate the database. Includes features that enable users to directly access the data housed in the database. Provides change, retrieval, insertion, and deletion functions.
Services provided, cont. DBMS typically provide a way to protect
the data from unauthorized use. Identification of users and password protection of the database level.
DBMS typically provide a way to customize users views of the database (which is also a protection).
Examples of DBMS systems MS Access Oracle MS SQL Server Dbase
Each product from each vendor will have some of its own unique features.
Users Users provide the fourth component of the
system. Users interpret the data found in the dbms. Users maintain the database and data within
the data base.
Application programmers Make use of the database in their programs. They may issue queries, provide screens,
provide reports, provide other processing to the users of the system.
Build “user” interfaces into the application. Database provides “data independence” with
their applications. (Database can change, while applications remain constant).
End Users Typically use the database through some
user interface functions provided by the application programmers.
May use functions provided directly by the database management system, such as queries.
Database Administrator Maintains the database itself. Usually maintains the security within the
system. May not decide what the security schema is. May not decide what is needed in the
database.
Databases, cont Book talks about “persistent” data.
Typically a database is developed to retain data over time.
Some of that data will have a longer life than other data. Some data will “persist” for only a few moments to support some other type of processing.
Data as an Organizational Asset We spend time on databases since data is
one of the most valuable assets within an organization.
We spend time collecting it; we spend time retrieving it.
A good database design facilitates the use of that organizational asset. A bad database design will devalue the asset.
Advantages of a database Data independence - separate the
application from the details of the underlying data. Applications interact with the DBMS not with the actual data.
Database applications provide transaction support. A transaction (or logical unit of work) either goes all the way through or goes none of the way. It cannot be divided.
Entities and Relationships An entity is any object about which we want to
gather information.• Students
• Courses
• Grades
• Professors
A relationship describes how those entities are connected to one another. (A relationship might be an entity in its own right.)
E/R Diagram Provides the “picture” of the database using
entities and the relationships among those entities.
Take a look at page 11 of the book. We will spend more time on this shortly.
Entities, cont Entities have properties which describe that entity. Your cards have listed some properties of the
students that you interviewed. Each type of entity will have its own unique
properties and often some properties that it shares with other entities.• Ex. Cars and trucks might be two entities, but they
share many properties such as having wheels.
What is a relational database Any database that is based on the formal
theory called “The relational model of data.” Consists of “tables” which are equivalent to
files. Consists of “rows” and “columns”. Consists of formal operations to work with
the tables.
What are some other types? Hierarchical database system - Data is represented
as a tree structure with pointers from one entity to another.
Netword database system - Data is represented as a network of entities.
The database system is categorized by its data structure.
We will only be dealing with relational databases.