What is data?
Information = Data with interpretation
Weather measurements: Temperature in Celsius scale --> We know that temperatures of this week are raising
Knowledge = Information with meaning in the context
"It is quite hot in Finland in the end of August"
”Don’t wear long trousers today”
Simple definition: Set of interconnected data
Long definition: Shared collection of logically related data (and a description of this data), designed to meet the information needs of an organization.
Shared: Several applications can use.
Separated from the individual application (its own layer)
Example of a database
Database for student administration
A collection of data is needed about
Student's basic information, enrollments, credits
Courses
Classrooms, premises
Teachers …
There can be several views to the same information.
The teacher can see the grades of all students and can enter them
The student can see his own information and performance **Characteristics of a database**
Database needs to…
Support multiple users
Transactions
Provide different views to same data
Three-Schema Architecture: Internal, conceptual and external level
Internal: describes physical storage structure
Conceptual: e.g. ER-model
External: part of the data is available to a particular user group and the rest is hidden (e.g. Views)
Be separated from the applications that use its data Problems of using files for storing data
Each application has its own data
All necessary data may not be available to other applications
Multiplication of information
The same data in several places
For example, customer data
Are you sure they are up to date?
Dependencies
File structure
Incompatible file formats
Language dependency
Standardized surveys and contents.
Making changes is tedious for both to the application and to the information content **When to use a database instead of files for storing data?**
If the data is local and static
Saving data to a file could work OK
Example: several software have a config.xml or a similar file.
If you have any information that is
Needed elsewhere than in one application (reading and/or updating)
It will be changed (often) and
It must remain reliable and consistent
There's a lot of data to be stored
You need to search and query data under (changing) conditions

Introduction to database design aims, phases and ER modelling