Introduction to MongoDB - MongoDB Basics

Database Reference

In-Depth Information

The second key benefit to using BSON is that it is easy and quick to convert BSON

to a programming language's native data format. If the data were stored in pure JSON, a

relatively high-level conversion would need to take place. There are MongoDB drivers for

a large number of programming languages (such as Python, Ruby, PHP, C, C++, and C#),

and each works slightly differently. Using a simple binary format, native data structures

can be quickly built for each language, without requiring that you first process JSON. This

makes the code simpler and faster, both of which are in keeping with MongoDB's stated

goals.

BSON also provides some extensions to JSON. For example, it enables you to store

binary data and to incorporate a specific datatype. Thus, while BSON can store any JSON

document, a valid BSON document may not be valid JSON. This doesn't matter, because

each language has its own driver that converts data to and from BSON without needing to

use JSON as an intermediary language.

At the end of the day, BSON is not likely to be a big factor in how you use MongoDB.

Like all great tools, MongoDB will quietly sit in the background and do what it needs to

do. Apart from possibly using a graphical tool to look at your data, you will generally work

in your native language and let the driver worry about persisting to MongoDB.

Supporting Dynamic Queries

MongoDB's support for dynamic queries means that you can run a query without

planning for it in advance. This is similar to being able to run SQL queries against an

RDBMS. You might wonder why this is listed as a feature; surely it is something that every

database supports—right?

Actually, no. For example, CouchDB (which is generally considered MongoDB's

biggest “competitor”) doesn't support dynamic queries. This is because CouchDB has

come up with a completely new (and admittedly exciting) way of thinking about data. A

traditional RDBMS has static data and dynamic queries. This means that the structure

of the data is fixed in advance—tables must be defined, and each row has to fit into that

structure. Because the database knows in advance how the data is structured, it can make

certain assumptions and optimizations that enable fast dynamic queries.

CouchDB has turned this on its head. As a document-oriented database, CouchDB is

schemaless, so the data is dynamic. However, the new idea here is that queries are static.

That is, you define them in advance, before you can use them.

This isn't as bad as it might sound, because many queries can be easily defined

in advance. For example, a system that lets you search for a topic will probably let you

search by ISBN. In CouchDB, you would create an index that builds a list of all the ISBNs

for all the documents. When you punch in an ISBN, the query is very fast because it

doesn't actually need to search for any data. Whenever new data is added to the system,

CouchDB will automatically update its index.

Technically, you can run a query against CouchDB without generating an index;

in that case, however, CouchDB will have to create the index itself before it can process

your query. This won't be a problem if you only have a hundred topics; however, it will

result in poor performance if you're filing hundreds of thousands of topics, because each

query will generate the index again (and again). For this reason, the CouchDB team does

not recommend dynamic queries—that is, queries that haven't been predefined—in

production.

MongoDB Basics

Search WWH ::

Custom Search

Home