Databases Reference
In-Depth Information
●
A distributed database is a single logical database that is physically divided among computers at several sites
on a network. A user at any site can access data at any other site. A DDBMS is a DBMS capable of support-
ing and manipulating distributed databases.
●
Computers in a network communicate through messages. Minimizing the number of messages is important
for rapid access to distributed databases.
●
A homogenous DDBMS is one that has the same local DBMS at each site, whereas a heterogeneous DDBMS
is one that does not.
●
Location transparency, replication transparency, and fragmentation transparency are important characteris-
tics of DDBMSs.
●
DDBMSs permit local control of data, increased database capacity, improved system availability, and added
efficiency.
●
DDBMSs are more complicated than non-DDBMSs in the areas of updating replicated data, processing que-
ries, treating concurrent update, providing measures for recovery, managing the data dictionary, designing
databases, and managing security and backup requirements.
●
The two-phase commit usually uses a coordinator to manage concurrent update.
●
C. J. Date presented 12 rules that serve as a benchmark against which you can measure DDBMSs. These
rules are local autonomy, no reliance on a central site, continuous operation, location transparency, fragmen-
tation transparency, replication transparency, distributed query processing, distributed transaction manage-
ment, hardware independence, operating system independence, network independence, and DBMS
independence.
●
A file server stores the files required by users and sends entire files to the users.
●
In a two-tier client/server architecture, a DBMS runs on a file server and the server sends only the requested
data to the clients. The server performs database functions, and the clients perform presentation functions.
A fat client can perform the business functions, or the server can perform the business functions in a thin cli-
ent arrangement.
●
In a three-tier client/server architecture, the clients perform the presentation functions, database servers per-
form the database functions, and application servers perform business functions. A three-tier architecture is
more scalable than a two-tier architecture.
●
The advantages of client/server systems are lower network traffic; improved processing distribution; thinner
clients; greater processing transparency; increased network, hardware, and software transparency; improved
security; decreased costs; and increased scalability.
●
Web servers interact with Web clients using HTTP and TCP/IP to display HTML Web pages on Web clients'
screens.
●
Dynamic Web pages, not static Web pages, are used in e-commerce; and server-side and client-side exten-
sions provide the dynamic capability, including the capability to interact with databases.
●
Cookies and session management techniques are used to counteract the stateless nature of HTTP.
●
XML was developed in response to the need for data exchange between organizations and due to the inabil-
ity of HTML to specify the structure and meaning of its data. XML is a metalanguage designed for the
exchange of data on the Web.
●
The W3C has developed recommendations for other languages related to XML. These languages include
XHTML, a markup language based on XML and a stricter version of HTML; DTD and XML schema, both used
to specify the structure and meaning of data in an XML document; XSL, a language for creating stylesheets;
XSLT, which transforms an XML document into another document; and XQuery, which is an XML query
language.
●
OLTP is used with relational database management systems, and OLAP is used with data warehouses.
●
A data warehouse is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of
management's decision-making process.
●
A typical data warehouse data structure is a star schema consisting of a central fact table surrounded by
dimension tables.
307