Skip to main content

Data Organizing Methods


Two standard methods of organising data for data processing applications are file oriented approach and database oriented approach. File oriented approach was traditional method used in early days of data processing. It has been replaced today largely by database oriented approach. However, several applications, dealing with simple and small data sets, use file oriented approach, even today.

Hence, both these methods are described below.


File oriented approach:

This method organizes an application's data into one or more files and the application program processes the data stored in these files to generate desired output. For example, it is customary to use a master file of permanent ( and, usually, the latest) data, and transaction files containing data of temporary nature. For instance, in a pay roll application, in additional to containing all permanent details of each employee ( name, address, employee code, pay rate, income tax rate, etc.) master payroll file also contains current gross-pay-to-date total and tax-paid-to-date total. Transaction payroll file contains details of hours of worked this month, normal and overtime hours, and if peace work is involved, quantity of goods made. When the user runs monthly cycle of payroll application, the application uses both master and transaction files to generate this month's payslips, and updates the master file to make it ready for following month's processing.



Database oriented approach:

File oriented approach of Organising data for data processing applications is simple, inexpensive, and usually easy to use. It, however, suffers from following limitations:


1. Limited query flexibility

File oriented approach organises records in a file according to a key field. Hence, it can quickly process queries based on key field value. However, when the key field is not relevant to desired information, it needs to search entire file. For example, if the key field is employee code in an employee file, and if a user needs to list names of all employees having certain educational background, the application has to search all records to produce this information. If the user needs this information frequently, he/she needs to create a new file structure on educational background key with a new program to process this file.


2. Data redundancy:

In file oriented approach, same data items are often present in many different files. For instance, in the example above of creating a new file structured on educational background key, the new file will also have several data items present in the first employee file. Similarly, let us assume that a bank uses file oriented approach for storing in costumers data, and it maintains separate costumer files for its various types of services, such as saving account, term deposits, and loan accounts. In this case, many data items for a bank costumer ( such as home address, age, occupation, and identification information) may be present in multiple files. Repetition of same data items in more than one file is known as data redundancy. It leads to increase in cost of data entry and data storage. It also needs to load data integrity problem described below.


3. Data integrity problem

Data integrity refers to consistency of data in all files. For consistency, the system must carry out any change in the data item in every file containing that data item. For instance, in our example of a bank maintaining separate costumer files for each type of account, when a costumer moves to a new address, the bank must update his/her address field in all costumer files containing this costumers regard. Integrity of data is necessary to avoid confusions that may result when an application changes a data item, if it doesn't update the data item in all files containing the data item, there may be frequent discrepancies among reports, which the application produces from different files, causing confusion.

4. Lack of program/Data independence: 

In file oriented approach, an application program usually contains data format statements, which define the format of each data field precisely as the application need it for processing. Because different applications create different files, this often results in different files having same data item stored using different data formats. Data dependence occurs when data is dependent on application. Due to data dependence problem, whenever there is a need to add, delete and change data formats, we also need to change application programs. Data dependence problem also causes incompatible among data files from different applications due to which we can not link this data files, if such as need arises.


5. Limited data security flexibility: 

Normally a file oriented approach offers file level data security feature. Hence, it can enforce data access restrictions on an entire file only, not on a second or a field of data item.


Dissatisfied with these limitations of file oriented approach, researchers began lot for a better method of organising data to consolidate activities. Their efforts resulted in data base oriented approach for organising data. This approach integrates together data from multiple related files in the form of a data base having following properties:

1. It provides greater query flexibility.

2. It reduces data redundancy.

3. It solves data integrity problem.

4. It makes data independent of application programs, enabling users to add, change, and delete fields from data base without effecting existing programs.

5. It also includes data security features at data base level, record level and even at field level to provide greater flexibility of restricted data access. 



Comments

Popular posts from this blog

Word-processing Package

The term word processing describes use of hardware and software to create, edit, view, format, store, retrieve and print documents (written materials such as letters, reports, books, etc.). A word processing package enables us to do all these on computer system. Commonly supported features: Today's word processing packages normally support the features described below: 1. Entering Text : This feature allows a user to enter text with computer's keyboard. Every character typed on keyboard is displayed immediately on computer's screen. Word wrap feature of word processing software determines when current line is full, and it automatically moves the text that follows to the next line. Hence, only time the user has to press Enter key is at the end of the paragraph. This feature is very useful to those having fast typing speed because they can enter text at their speed, without the need to keep track of where to end a line. 2. Editing Text : This feature allows a user to make cha

Computer Languages - introduction

  This chapter continues with the discussion of development of computer programs. Once a programmer has finished planning of a computer program, he/she is now ready to write the steps of the corresponding algorithm in a programming language.  Computer language or programming language is a language acceptable to a computer system, and the process of writing instructions in such a language is called programming or coding.   The goal of this post is to introduce some popular programming languages.  Analogy with natural languages:   Language is a mans of communication. We use a natural language such as English, to communicate our emotions and ideas to others. Similarly a programmer uses a computer language to instruct computer what he/she want to do.  All natural languages (English, French, German, etc.) use a standard set of words and symbols for communication. Everyone uses that language understands these words and symbols. The set of words allowed in a language is called its vocabulary.

Central Processing Unit

  Central Processing Unit (CPU) A Central Processing Unit is also called a processor, central processor, or microprocessor. It carries out all the important functions of a computer. It receives instructions from both the hardware and active software and produces output accordingly. It stores all important programs like operating systems and application software. CPU also helps Input and output devices to communicate with each other. Owing to these features of CPU, it is often referred to as the brain of the computer. Central Processing Unit CPU is installed or inserted into a CPU socket located on the motherboard. Furthermore, it is provided with a heat sink to absorb and dissipate heat to keep the CPU cool and functioning smoothly. Generally, a CPU has three components: 1. Control Unit 2. Arithmetic Logic Unit 3. Memory or Storage Unit 1. Control Unit : It is the circuitry in the control unit, which makes use of electrical signals to instruct the computer system for executing already

Introduction to Computer

  The word "computer" comes from the word "compute", which means "to calculate". Hence, people usually consider a computer to be a calculating device that can perform arithmetic operations at high speed.  The straightforward meaning of a computer is a machine that can calculate. However, modern computers are not just a calculating device anymore. They can perform a variety of tasks. In simple terms, a computer is a programmable electronic machine used to store, retrieve, and process data. According to the definition, " A computer is a programmable electronic device that takes data, perform instructed arithmetic and logical operations, and gives the output." Whatever is given to the computer as input is called 'data', while the output received after processing is called 'information'. Although the original objective of inventing a computer was to create a fast calculating device, we name define a computer as a device that operates

Computer system architecture

  Computer Architecture  A computer system is basically a machine that simplifies complicated tasks. It should maximize performance and reduce costs as well as power consumption.The different components in the Computer System Architecture are Input Unit, Output Unit, Storage Unit, Arithmetic Logic Unit, Control Unit etc. A diagram that shows the flow of data between these units is as follows − The input data travels from input unit to ALU. Similarly, the computed data travels from ALU to output unit. The data constantly moves from storage unit to ALU and back again. This is because stored data is computed on before being stored again. The control unit controls all the other units as well as their data. Details about all the computer units are − Input Unit The input unit provides data to the computer system from the outside. So, basically it links the external environment with the computer. It takes data from the input devices, converts it into machine language and then loads it into t