Data types: A data type is
classification of various types of data Examples of data. Data types are
integer, float, character, string (A group of characters)
Integer: An integer data type can hold
a whole number.
Float: Float values are also called
real data valves are stored in a single memory location.
Logical data type: It consists of 2
values is stored by true/false.
Characters: Character data type when we
went to include letters, numbers, spaces, symbols and punctuations, the size is
1 byte. A few data types are corrective field, data fields, integer fields and
text fields.
Data processing: Data (15) a collection
of facts, un organized to be organized into useful information. We give data as
an input and we get the output can be used to help people make decisions it is
called information data processing system is used to include the resources such
as people, procedures and devices that are used to accomplish the processing of
data for producing desirable output.
Data storage
hierarchy:
Field: A field is a meaning full
collection of related characters.
Ex: S.No,
S.name….. etc.
Record: Fields we normally grouped
together to form a record.
Ex: Field
Character
![]() |
Field
or Data
![]() |
Record
File
File organization:
System designers choose to organize, access and process records and files in
different ways depending on the type of application and the needs of users.
There are 3 commonly used file organizations.
1. Sequential
2. Direct
3. Indexed
sequential organization
The selection of a particular organization depends upon the
type of application. File organization requires the use of some key field,
identifying value i.e. found in every record in the file.
Serial file
organization: Records are arranged one after another in no particular order
serial organisation is commonly found with transaction data where records are
created in a file in the order in which transactions takes place.
For example: Daily
purchase.
Sequential file
organization: Records are stored one after another in an ascending (or)
descending order determine by the key field of the record.
Ex: Payroll
To access a record, the computer must read the file in
sequence from the beginning.
Advantages:
1.
Easy to understand, maintain and organize.
2.
There is no overhead in address generation.
3.
It is more efficient and economical file organisation
Disadvantages:
1.
Active ratio is very low.
2.
Transactions must be stored and placed in sequence
prior to processing.
3.
Data redundancy (duplication of data) is typically
high.
Direct access file
organization: Allows immediate access to individual record on the file, the
records are stored and retrieved using a relative record number, which gives
the position of the record in the file. The most widely used direct access
methods are
† Self Direct addressing: Key is used as
its relative address. Therefore we can complete the records address
directly from the record key and the physical address of the first record in
the file. It is a fixd length
record in a sequential file and in which the keys are from a complete range of
consecutive records.
Key Record
F1 1-80
F2 81-130
Advantages: No
need to store an index
Disadvantages:
1. Records must be of fixed length.
2. If some records are deleted their storage space remains
empty.
Indexed sequential
file organisation (ISAM):
It is an hybrid between sequential and direct access. The
records with in the file are stored sequentially but direct access to individual
record possible.
This type of file organize is suitable for batch processing
and online processing
Advantages:
1.
Efficient and economical use.
2.
Activity ratio is very high.
3.
Permits direct access processing of records in a
relatively efficient way when the activity ration is low.
Disadvantages:
1.
Access to records may be slower than direct files.
2.
Less efficient in the use of storage space.
† Random access method: Transactions can
be processed in any order and written at any location to store
file. The desired records can be directly accessed using randomizing procedure.
Advantages:
1.
Retrieval of record is quick and direct.
2.
Need not be stored in sequence.
3. It is suitable interactive online
applications like banking, air line and railway reservation system.
Disadvantages:
1.
Address generation over head is involved for accessing
each record due to hashing function.
2.
Less efficient than sequential file organisation.
† Best file organisation: The following
factors are to be considered for the best file organisation.
1.
File volatility
2.
File activity
3.
File interrogation
4.
File size.
File volatility: No.
of additions and deletions to the file in given period of time.
File activity: It
is the proportion of master file records that are actually used (or) accessed
in a given processing run. In real time file, where each transaction is
processed immediately at a time, only one master file record is accessed (ATM
transaction). If requires a direct access method.
File interrogation: File
interrogation means retrieval of individual records must be fast to support a
real time operation such as airline reservation then some kind of direct
organisation is required.
File size: Immediate
response must be organized by direct access method in large file system. On the
other hand with small files it may be more efficient to search the entire file
sequentially.
Database Management
System (DBMS):
A database management system is a set of software
programmes, that controls a organisation storage, management and retrieval of
data in a data base. A data base is repository for related collection of data
for example an address book can be a data base where names, address, phone No.
of friends on business contacts are stored.
Name address Friend
Ph.No. Business
contact
Management problems of
file processing:
Data needed for each user application was stored in
independent data files processing consisted of using separate computer
programme that updated these independent data files and used them to produce
documents and reports by separate user application. There are several problems
in file processing approach.
1. Data duplication
2. Independent data files include a lot of duplicated data,
the same data is recorded and stored in several files.
3. This data integration: Independent files makes it
difficult to provide end users with the information for adhoc requests that
requires accessing data stored in several different files. It is time consuming
of expensive for the organisation.
Data dependence:
In file processing system, major components of a system are
1. Organisation
of files
2. Their
physical location on storage
3. Hardware
4. Application
software
The above components are used to access these files depend
on one another in significant ways.
Data integrity &
Security: There are certain integrity constraints defined in DBMS to
protect an unauthorized access to the data in the databases. For example, when
inserting the data for a particular field, say salary for an employee data
base. It cannot be null. It does not allow the user to leave the field blank,
thus providing integrity and security on the data base.
Solutions for
Database Management:
File processing system.
1.
Reduce data redundancy and inconsistency
2.
Enhance data integrity and security
3.
Provide logical and physical data independence
4.
Improved data sharing
5.
Low cost of developing and maintaining system
What is a data Base?
A data base is a computer file that uses a particular file
organisation to facilitate rapid updating of individual records, simultaneous
updating of related records, easy access to all records by all processed data.
Explain the
architecture of DBMS:-
The following are the 3 levels architecture
1.
External (or) user view
2.
Conceptual (or) global view
3.
Internal (or) Physical view
External (or) user
view:
1.
It is the highest level of data abstraction
2.
It includes only those portions of data base which is
concerned to the user
3.
It describes by means of external scheme
Global or Conceptual
View:
1.
All data base entitles and relationships among them are
included.
2.
Single view represents the entire data base.
3.
It describes all records, relationships and
constraints.
Internal (or)
physical view:
1.
It is the lowest level of data base abstraction
2.
It indicates how data will be stored.
3.
It describes data structure and access method.
4.
It is closest to the physical storage method.
Schema: Moving
unordinary file management to data base system is to separate all data
definitions from the applications, programmes and to consolidate them into a
separate entity called schema, the indication of the logical relationship
between various components of the data base, there are 3 types of schemas
1.
Physical schema (lowest level)
2.
Logical schema (intermediate level)
3.
Sub-schema (highest level)
Data independence: It
is an ability of a database to modify a table definition at one level without
affecting the next higher level, is called data Independence. It means that
when the table is changed at one level and the next level remains unchanged.
There are 2 types of data independence.
1.
Physical data Independence
2.
Logical data Independence
Physical data
Independence:
There is no need to change the conceptual schema, changes in
internal schema is needed by upgrading storage structure and to improve the
performance of the system.
Logical data
independence:
There is no need to change in the external scheme we can
change in the conceptual schema by adding, deleting, updating the records in
the data base.
Parts of database
management system:
The data base management system base major parts
1.
Data
2.
Hardware
3.
Software
4.
User
Data: Most
organizations generate, store and process a large amount of data. The data acts
as a bridge between hardware, software and users. We access it directly through
some application programmes.
Hardware: It
consists of secondary storage device such as magnetic disk, Magnetic tape, CD,
DVD for storing and retrieving the data in a fast and efficient manner.
Software: DBMS
acts as a bridge between user and the data base, we use SQL or application
software’s to do the operations on the data base like insertion, deletion and
up-dation.
User: The broad
classes of users are
a)
Application programmers and system analysts.
b)
End-user.
c)
Data base administrator.
d)
Data base designer.
System analyst: Determines the requirements of End user.
Application
programmers: The programmers implement these specifications as programmes
and the test debug document and maintain the transactions.
End users: There
are the people who require access to the data base for updating and generating
reports.
Database
Administrator: Is responsible for authorization access, coordination,
monitoring and for acquiring the needed software and hardware resources to the
database.
Database designers:
These are responsible for choosing appropriate structures to represent and
store this data.
Relationships in
database: It is a complex task to identify relationship among records in a
large database, the various types of relationships are
i.
One-to-one relationship as in a single parent record to
a single child record or husband record to wife.
ii.
One-to-many relationship single parent records to a
single child record.
iii.
Many-to-many relationships: Two or more parents records
to two or more child records.
Data base structures:
Database management systems are designed to use 3 database structures to
provide easy access to information stored in databases and their relationships
logically.
The 3 database structures are
i.
Hierarchical database structure
ii.
Network database structure
iii.
Relational database structure
Hierarchical database
structure: Records are logically organized into a hierarchical of
relationship. A hierarchically structured database is arranged logically in an
inverted tree pattern. All records in hierarchial is called as “NODES”.
Hierarchical data structure implements one-to-one and
one-to-many relationship.
Features of
Hierarchical database:
i.
Data bases are less flexible than other database
structures. Declaration between records are relatively fixed by the structure,
Relation between records are relatively fixed by the structure.
ii.
Manager use of query language to solve the problem may
require multiple searches and prove to very time consuming.
Network Data Base
structure:- It builds on the concept of multiple branches, lower level and
higher level structures. A network data base structure views all records in
sets. An owner record and one or more member records.
Relational database
Models: In Relationship data base model, the hierarchical and network
database structures require exploit relationships between records in the data
base. The key terms are used in relational data base model are relations.
Attributes and domains. A relation is
a table with columns and rows.
Key: Key often
referred to the specific structure and components of link list, chain of
pointers (or) a key is a set off one (or) more columns whose combined values
are unique among all occurrences in a given table. There are various types of
relational keys.
i.
Primary key:
A primary key is used to get an unique value.
ii.
Candidate key:
One or more columns whose combined values are unique among all occurrences
iii.
Alternate key:
Any table is simply those candidate keys which are not currently selected as a
primary key.
Note: An alternate key is a function of
all candidate key – primary key.
iv.
Secondary key:
These keys are used to optimize the data access.
INFORMATION TECHNOLOGY
Explain Data Base
Models:-
1.
Distributed data
base:- Sometimes an organisation may require decentralizing its data base
by scattering it with computing resources to several locations, so that the
processing is done at more than sight.
2.
Replicated data
base:- Duplicates of data are provided to the sights, so that we can access
same data concurrently. Replication data base is costly in terms of system
resources and also maintaining the consistency of the data elements.
3.
Partitioned data
base:- Database is divided into parts that are required and appropriate for
the respective sight, only those sights are partitioned or distributed without
replication of entire data.
4.
Entity
relationship model data base:- ER model is a specialized graphic that
illustrates relationship between entities in the data base. It is used to
produce a type of conceptual scheme of a system asset of all entities of same
time is called “entity set”. The degree of relationship indicates the link
between two entities for specific occurrence of each. The degree of
relationship is called “cardinality”. The basics of “ER” diagram are
Define object
oriented data base:- Objects are entities converging some meaning for us
and possess such retributes to characterize them and interacting with each
other. Object oriented data base provides mechanism to store complex such as
images audio and video.
Any object in the real world consists of member data and
member functions. So each object is an independently functioning application.
Examples of object
oriented language:-Pipthonog, C++, JAVA, C#, Small talk etc.
Client-server data
base:-In client server data base where client requests the server and
server response according to the client request. Client machine contains user
interface, server machine contains data base, both are coupled with a network
of highbandwidth.
Client 1
Client 2 

Client-Server Data Base
Server
The above diagram is an example of two tier architecture.
Knowledge Data Base:-
A knowledge Data Base System provides system provides functions to define, create, modify, delete and
read data in a system. The time data maintained in a data base system
historically has been declarative data description the static aspects of the
real world objects and their associations. In this knowledge data base, dynamic
aspects of the real world objects are stored; so that it can be used for
decision support system and executive information system it contains integrated
data. Detailed data, summarized data, historical data and meta data is called a
“DATA WAREHOUSE” The process of recognizing the required data among data ware
house “DATA MINING”
Explain the
components of Data Base:- There are 2 components used in data base
1.
Data definition Language (DDL)
2.
Data Manipulation Language (DML)
Data Definition
Language:- It defines the conceptual level which means it is in between
“user level” and “physical level”
Functions of data
definition language:-
1.
It describes schema and sub-schema
2.
They indicates the keys of the record
3.
Provide data security
4.
Provide logical and physical data independence
DATA MANIPULATION
LANGUAGE:- They provide data manipulation techniques like deletion,
modification, insertion, replacement, retrieval, sorting and display of data.
They facilitates use of relationship between records they provide for
independence of programming languages by supporting several high level
procedural languages like PL/SQL,COBAL, C++ … etc
STRUCTURE OF DBMS:-
DDL COMPILER:-
1.
It converts data definition statements into set of
tables.
2.
Tables contain meta-data
DATA MANAGER:-
1.
It is a central software component
2.
It refers to by data base control system
3.
It converts operations in user queries to physical file
system
FILE MANAGER:-
1.
Responsible for file structure
2.
Responsible for managing space
3.
Responsible for requesting block from disk manager
4.
Responsible for transmitting required records to data
manager.
DISK MANAGER:-
1.
It is a part of operating system it carries out all
physical input, output operations.
PUENY MANAGER:-
2.
It interprets users query.
3.
It converts to an efficient series of operations
4.
It uses information to modify query
5.
It uses data dictionary
6.
It prepares an optimal plan to access data base for
efficient data retrieval
DATA DICTIONARY:- It
maintains information of meta-data and data
DATA BASE
ADMINISTRATOR (DBA):-
Data Base Administration (DBA) is a person responsible for
the design, implementation, maintenance and repair of an organizations data
base. They are also known as coordinator (or) “data base programmer”. The
following are the roles and responsibilities of data base administrator.
1.
DBA has an overall authority to establish and control
data definitions and standards.
2.
DBA provides data base security system to restrict
un-authorised users.
3.
DBA also trains and assists application programmes in
the use of data base.
4.
Data base administrator must have a discussion with
users and then he decides the schedule and accuracy requirements, the way and
frequency data access, search strategies, physical storage response time, level
of security etc.
5.
DBA is responsible for originating and updating of
data.
6.
DBA is responsible for controlling access to data base and two other
important functions that handled by the DBA using DDL.
7.
DBA prepares “documentation” which includes procedures,
guidelines, data description and data base environment.
8.
DBA is responsible for taking back ops and recovery
procedures.
9.
DBA also monitors the data base environment.
10. DBA
also setup procedures for identify and correcting violation of standards,
documents and correct errors. Finally DBA is responsible for creating new
utility programmes (or) new system releases.
TYPES OF DATA BASES:-
The growth of distributed processing, end user computing,
decision support and executive info-systems has caused the development of
several types of data bases.
OPERATIONAL DATA
BASE:- These data bases store detailed data needed to support support the
operation of the entire organisation. Ex: customer data base, personnel data
base, inventory database etc.
MANAGEMENT DATA BASE:-
These data bases store data and info extracted from selected operational and
external data base. It consists of summarized data and information to support
managerial decision making.
INFORMATION WARE DATA
BASE:- In his data base, current and previous years of data is stored it is
an external source of data that has been standardized and integrated so that it
can be used by managers and end users.
EXTERNAL DATA BASE:-
Privately owned online data bases. Ex: databanks, newspapers magazines and
other periodicals from bibliographic data base.
TEXT DATA BASE:-
IT has stored data documents electronically, they use text data base management
system software to help, create, store, search retrieve, modify and assemble
documents and other information stored as text data in such data bases.
IMAGE DATA BASE:-
Wide variety of images are stored electronically. Ex: photographs, animated
videos etc.,
STRUCTURED QUERY
LANGUAGE:- A query language is a set of commands to create, update and
access data from a data base allowing user to raise queries with the help of
the programmers.
Ex: My SQL server, oracle data base.
SQL usually consists of 3 parts
Ø
DML-DATA MANIPULATION LANGUAGE
Ø
DDL-DATA DEFINITION LANGUAGE
Ø
DCL-DATA CONTROL LANGUAGE
DML – It consists
of select, update, delete, insert statements
DDL – CREATE
& ALTER
DCL – GRANT &
REVOKE
BACK UPS:- This
is an utility programme used to copy the entire content of the data base
(duplication of data) it consists of root file log file mirror log file and
other data base files called db data base spaces.
RECOVERY:-
Recovery is the sequence of tasks performed to restore a data base to some
point in kind recovery is performed when hardware (or) media failure occurs.
Backup is a good practice of recovery.
ONLINE BACK UP:-
Online backup process being the data base engine externalizes all cached data
pages kept in memory to the data base files on disk the process is called a
check point.
TRANSACTION LOG:-
DATA BASE engine continues recording activity in the transaction log file while
the data base is being back up.
LIVE BACK UP:-
This provides a redundant copy of the transaction log for restart your system
on a secondary machine in the event the primary data base becomes unusual.
OFFLINE BACK UP:
Back up is being when the data base is shutdown through a normal shutdown
process. The database engine commits the data in the data base files.
STRATEGY FOR
DEVELOPING A BACKUP & RECOVERY:-
Ø
Understand the backup & recovery means to
your business
Ø
Commits time & source for the project
Ø
Beware of any external factor that affect the
recovery
Ø
Develop test, time, health check, deploy &
monitor
DATA WARE HOUSE:-
A data ware house is a repository of an organizations
electronically stored data ware houses are designed to facilitate reporting and
supporting data analysis.
ADVANTAGES:
i.
Reduces the response time
ii.
Optimises the reporting and analysis the information
iii.
Reports in operation system often required written
specific computer programmes which was slow and expensive (disadvantage)
separate data bases being to support management information and analysis
process this we use different tools such as spread sheet so that the
requirement of managers auditors and accountants will get very easy and fast
data to analyse. Data ware houses have evolved through several fundamental
stages.
OFFLINE OPERATIONAL
DATA BASE:- In this initial stage are developed by simply copying the data
base of an operational system to an offline server where the processing load of
reporting does not impact on the operational system performance.
OFFLINE DATAWARE
HOUSE:- In this stage of evolution are updated on a regular time cycle
(usually, daily weekly or monthly)
REAL TIME DATA WARE
HOUSE:- At this stage updation takes place in transaction base.
Ex:- Delivery of goods and ticket booking
INTEGRATED DATA WARE
HOUSE:- At this stage transactions that are passed back into the operation
system for use in daily activity of the organisation.
COMPONENTS OF DATA
BASE HOUSE:- The primary components of the data ware house are
i.
DATA SOURCES:- Where
data is stored electronically where management can use the data for analysis
(or) analytics for server IBM DB2 etc for client side oracle DB-Informix etc.
ii.
DATA TRANSFORM:-
In this layer receives data from data sources cleans and standardize the data.
This is often called “staging”.
iii.
META DATA:-
In few data ware houses these are optional components such as dependent data
marts, logical data marts, operational data store.
ADVANTAGES OF DATA
WARE HOUSE:-
i.
It increases the ability of access to reports and
analysis of information.
ii.
Increases data consistency.
iii.
Able to combine data from different sources.
DISADVANTAGES:-
i.
Extracting cleaning and loading data could be time
consulting
ii.
We can get out dated information quickly
iii.
Providing training to end users
iv.
Data warehouse is usually a non static and maintenance
costs are high.
MINING: Mining is
concerned with the analysis of data and picking out relevant information it is
computer which is responsible for finding the patterns by identifying the
underlying roles and features in the data.
Examples of data mining software’s are SPSS, SAS, think
analysis and G-stat these are the following stages where we can interpret and
evaluation takes place.
i.
Selection: We have to take the data
according to some criteria.
ii.
Pre-processing: At this stage
certain information is removed which is deemed unnecessary and may slow down
queries.
iii.
Transformation:- The data is
transformed into different software tools like demographic so that it is easily
navigable.
iv.
Interpretation & evaluation:-
These patterns are identified by the system and transpose into knowledge which
is used to support human decision making.
No comments:
Post a Comment