Diffbook

e33a44c
7f5096d
b55f6de
bd8ccbb
2a97e28
6bb6aa3
5eedfc1
5b7af5a
8cfdd59
e7b0f37
9afbe23
8b71a64
b7ec98b
b498c5c
2649834
1984ff4
058dd92
f3c10cf
c9fa00a
847b1e9
856633c
ad37241
dc4a99c
25b48e1
67961ab
5ebcc42
7a446d8
60e4c94
1970e6f
9ea182b
6a1aeae
739a4c6
d5b7861
068e4f0
9f8f735
83521b1
952ca92
61df3d2
e706d40
c8d32ea
06e7da4
8fe4c65
6feb49d
6b98100
dfb272c
f2fd41d
ae63889
c482e34
95763bd
097d699
49f38ab
eaaa0e1
28816a9
7bce147
e234f6e
82dc64e
b0baf2e
4d53788
b1e25b8
11e58b1
c0da1a2
189c12f
7f18241
fe2d532
be8da74
9e10b63
1964e27
ac4003f
9353967

e33a44c
- Date : 2014-08-15
- some improvements to the DAL chapter: about using DAL standalone, more about commit in modules

+### The DAL: A quick tour

web2py defines the following classes that make up the DAL:

The **DAL** object represents a database connection. For example:

``sqlite``:inxx

db = DAL('sqlite://storage.db')

``:code

``define_table``:inxx

**Table** represents a database table. You do not directly instantiate Table; instead, ``DAL.define_table`` instantiates it.

db.define_table('mytable', Field('myfield'))

``:code

The most important methods of a Table are:

``insert``:inxx

``truncate``:inxx

``drop``:inxx

``import_from_csv_file``:inxx

``count``:inxx

for row in rows:

myquery = (db.mytable.myfield != None) | (db.mytable.myfield > 'A')

``:code

``Set``:inxx

**Set** is an object that represents a set of records. Its most important methods are ``count``, ``select``, ``update``, and ``delete``. For example:

myset = db(myquery)

rows = myset.select()

myset.update(myfield='somevalue')

myset.delete()

``:code

``Expression``:inxx

**Expression** is something like an ``orderby`` or ``groupby`` expression. The Field class is derived from the Expression. Here is an example.

myorder = db.mytable.myfield.upper() | db.mytable.id

db().select(db.table.ALL, orderby=myorder)

``:code

+### Using the DAL "stand-alone"

+The web2py DAL can be used in a non-web2py environment via

+``

+from gluon import DAL, Field

+# also consider: from gluon.validators import *

+``:code

[[dal_constructor]]

### DAL constructor

Basic use:

>>> db = DAL('sqlite://storage.db')

``:code

The database is now connected and the connection is stored in the global variable ``db``.

At any time you can retrieve the connection string.

``_uri``:inxx

>>> print db._uri

sqlite://storage.db

``:code

and the database name

``_dbname``:inxx

>>> print db._dbname

Now, if you insert a record again, the counter starts again at 1 (this is back-e

Notice you can pass parameters to ``truncate``, for example you can tell SQLITE to restart the id counter.

db.person.truncate('RESTART IDENTITY CASCADE')

``:code

The argument is in raw SQL and therefore engine specific.

``bulk_insert``:inxx

web2py also provides a bulk_insert method

>>> db.person.bulk_insert([{'name':'Alex'}, {'name':'John'}, {'name':'Tim'}])

[3,4,5]

``:code

It takes a list of dictionaries of fields to be inserted and performs multiple inserts at once. It returns the IDs of the inserted records. On the supported relational databases there is no advantage in using this function as opposed to looping and performing individual inserts but on Google App Engine NoSQL, there is a major speed advantage.

### ``commit`` and ``rollback``

No create, drop, insert, truncate, delete, or update operation is actually committed until web2py issues the commit command. In models, views and controllers, web2py does this for you, but in modules you are required to do the commit.

``commit``:inxx

>>> db.commit()

``:code

To check it let's insert a new record:

>>> db.person.insert(name="Bob")

``:code

and roll back, i.e., ignore all operations since the last commit:

``rollback``:inxx

>>> db.rollback()

``:code

If you now insert again, the counter will again be set to 2, since the previous insert was rolled back.

>>> db.person.insert(name="Bob")

``:code

Code in models, views and controllers is enclosed in web2py code that looks like this:

try:

execute models, controller function and view

except:

rollback all connections

log the traceback

send a ticket to the visitor

else:

commit all connections

save cookies, sessions and return the page

``:code

+So in models, views and controllers there is no need to ever call ``commit`` or ``rollback`` explicitly in web2py unless you need more granular control.

+However, in modules you will need to use ``commit()``.

### Raw SQL

web2py defines the following classes that make up the DAL:

The **DAL** object represents a database connection. For example:

``sqlite``:inxx

db = DAL('sqlite://storage.db')

``:code

``define_table``:inxx

**Table** represents a database table. You do not directly instantiate Table; instead, ``DAL.define_table`` instantiates it.

db.define_table('mytable', Field('myfield'))

``:code

The most important methods of a Table are:

``insert``:inxx

``truncate``:inxx

``drop``:inxx

``import_from_csv_file``:inxx

``count``:inxx

for row in rows:

myquery = (db.mytable.myfield != None) | (db.mytable.myfield > 'A')

``:code

``Set``:inxx

**Set** is an object that represents a set of records. Its most important methods are ``count``, ``select``, ``update``, and ``delete``. For example:

myset = db(myquery)

rows = myset.select()

myset.update(myfield='somevalue')

myset.delete()

``:code

``Expression``:inxx

**Expression** is something like an ``orderby`` or ``groupby`` expression. The Field class is derived from the Expression. Here is an example.

myorder = db.mytable.myfield.upper() | db.mytable.id

db().select(db.table.ALL, orderby=myorder)

``:code

[[dal_constructor]]

### DAL constructor

Basic use:

>>> db = DAL('sqlite://storage.db')

``:code

The database is now connected and the connection is stored in the global variable ``db``.

At any time you can retrieve the connection string.

``_uri``:inxx

>>> print db._uri

sqlite://storage.db

``:code

and the database name

``_dbname``:inxx

>>> print db._dbname

Now, if you insert a record again, the counter starts again at 1 (this is back-e

Notice you can pass parameters to ``truncate``, for example you can tell SQLITE to restart the id counter.

db.person.truncate('RESTART IDENTITY CASCADE')

``:code

The argument is in raw SQL and therefore engine specific.

``bulk_insert``:inxx

web2py also provides a bulk_insert method

>>> db.person.bulk_insert([{'name':'Alex'}, {'name':'John'}, {'name':'Tim'}])

[3,4,5]

``:code

### ``commit`` and ``rollback``

No create, drop, insert, truncate, delete, or update operation is actually committed until you issue the commit command

``commit``:inxx

>>> db.commit()

``:code

To check it let's insert a new record:

>>> db.person.insert(name="Bob")

``:code

and roll back, i.e., ignore all operations since the last commit:

``rollback``:inxx

>>> db.rollback()

``:code

If you now insert again, the counter will again be set to 2, since the previous insert was rolled back.

>>> db.person.insert(name="Bob")

``:code

Code in models, views and controllers is enclosed in web2py code that looks like this:

try:

execute models, controller function and view

except:

rollback all connections

log the traceback

send a ticket to the visitor

else:

commit all connections

save cookies, sessions and return the page

``:code

-There is no need to ever call ``commit`` or ``rollback`` explicitly in web2py unless one needs more granular control.

### Raw SQL

7f5096d
- Date : 2014-08-08
- typo (stray line)

after_connection=None,

tables=None,

ignore_field_case=True,

entity_quoting=False,

table_hash=None)

``:code

-m## Coding style

after_connection=None,

tables=None,

ignore_field_case=True,

entity_quoting=False,

table_hash=None)

``:code

b55f6de
- Date : 2014-08-08
- more small changes to Chapter 06

+m## Coding style

after_connection=None,

tables=None,

ignore_field_case=True,

entity_quoting=False,

table_hash=None)

``:code

[[connection_strings]]

#### Connection strings (the uri parameter)

``connection strings``:inxx

A connection with the database is established by creating an instance of the DAL object:

>>> db = DAL('sqlite://storage.db', pool_size=0)

``:code

``db`` is not a keyword; it is a local variable that stores the connection object ``DAL``. You are free to give it a different name. The constructor of ``DAL`` requires a single argument, the connection string. The connection string is the only web2py code that depends on a specific back-end database. Here are examples of connection strings for specific types of supported back-end databases (in all cases, we assume the database is running from localhost on its default port and is named "test"):

``ndb``:index

-------------

Migration is detailed below in Tables [[table migrations #table_migrations]]. Th

``fake_migrate_all = False`` If set to True fake migrates ALL tables

#### Experiment with the web2py shell

You can experiment with the DAL API using the web2py shell (-S [[command line option ../04#CommandLineOptions]]).

Start by creating a connection. For the sake of example, you can use SQLite. Nothing in this discussion changes when you change the back-end engine.

[[table_constructor]]

### Table constructor

``define_table``:inxx ``Field``:inxx

#### define_table signature

The signature for define_table:

Tables are defined in the DAL via ``define_table``:

>>> db.define_table('person', Field('name')

+ id=id,

rname=None,

redefine=True

common_filter,

fake_migrate,

fields,

format,

migrate,

on_define,

plural,

polymodel,

primarykey,

redefine,

sequence_name,

singular,

table_class,

trigger_name)

``:code

It defines, stores and returns a ``Table`` object called "person" containing a field (column) "name". This object can also be accessed via ``db.person``, so you do not need to catch the return value.

#### ``id``: Notes about the primary key

Do not declare a field called "id", because one is created by web2py anyway. Every table has a field called "id" by default. It is an auto-increment integer field (starting at 1) used for cross-reference and for making every record unique, so "id" is a primary key. (Note: the id counter starting at 1 is back-end specific. For example, this does not apply to the Google App Engine NoSQL.)

``named id field``:inxx

+Optionally you can define a Field of ``type='id'`` and web2py will use this field as auto-increment id field. This is not recommended except when accessing legacy database tables which have a primary key under a different name.

+With some limitation, you can also use different primary keys using the ``primarykey`` parameter. [[primarykey #primarykey]] is explained shortly below.

#### ``plural`` and ``singular``

Smartgrid objects may need to know the singular and plural name of the table. The defaults are smart but these parameters allow you to be specific. See smartgrid for more information.

#### ``redefine``

Tables can be defined only once but you can force web2py to redefine an existing table:

db.define_table('person', Field('name'))

db.define_table('person', Field('name'), redefine=True)

``:code

The redefinition may trigger a migration if field content is different.

[[record_representation]]

#### format: Record representation

It is optional but recommended to specify a format representation for records with the ``format`` parameter.

>>> db.define_table('person', Field('name'), format='%(name)s')

``:code

>>> db.define_table('person', Field('name'), format='%(name)s %(id)s')

``:code

or even more complex ones using a function:

>>> db.define_table('person', Field('name'),

format=lambda r: r.name or 'anonymous')

``:code

The format attribute will be used for two purposes:

- To represent referenced records in select/option drop-downs.

- To set the ``db.othertable.person.represent`` attribute for all fields referencing this table. This means that SQLTABLE will not show references by id but will use the format preferred representation instead.

#### rname: Record representation

``rname`` sets a database backend name for the table. This makes the web2py table name an alias, and ``rname`` is the real name used when constructing the query for the backend.

To illustrate just one use, ``rname`` can be used to provide MSSQL fully qualified table names accessing tables belonging to other databases on the server: ``rname = 'db1.dbo.table1'``:code

+[[primarykey]]

#### primarykey: Support for legacy tables

``primarykey`` helps support legacy tables with existing primary keys, even multi-part.

See [[Legacy Databases #LegacyDatabases]] below.

after_connection=None,

tables=None,

ignore_field_case=True,

entity_quoting=False,

table_hash=None)

``:code

[[connection_strings]]

#### Connection strings (the uri parameter)

``connection strings``:inxx

A connection with the database is established by creating an instance of the DAL object:

>>> db = DAL('sqlite://storage.db', pool_size=0)

``:code

``ndb``:index

-------------

Migration is detailed below in Tables [[table migrations #table_migrations]]. Th

``fake_migrate_all = False`` If set to True fake migrates ALL tables

#### Experiment with the web2py shell

You can experiment with the DAL API using the web2py shell (-S [[command line option ../04#CommandLineOptions]]).

Start by creating a connection. For the sake of example, you can use SQLite. Nothing in this discussion changes when you change the back-end engine.

[[table_constructor]]

### Table constructor

``define_table``:inxx ``Field``:inxx

#### define_table signature

The signature for define_table:

Tables are defined in the DAL via ``define_table``:

>>> db.define_table('person', Field('name')

rname=None,

redefine=True

common_filter,

fake_migrate,

fields,

format,

migrate,

on_define,

plural,

polymodel,

primarykey,

redefine,

sequence_name,

singular,

table_class,

trigger_name)

``:code

It defines, stores and returns a ``Table`` object called "person" containing a field (column) "name". This object can also be accessed via ``db.person``, so you do not need to catch the return value.

#### ``id``: The primary key ``id`` field

``named id field``:inxx

-Optionally you can define a Field of ``type='id'`` and web2py will use this field as auto-increment id field. This is not recommended except when accessing legacy database tables. With some limitation, you can also use different primary keys and this is discussed in the section on "Legacy databases and keyed tables".

#### ``plural`` and ``singular``

Smartgrid objects may need to know the singular and plural name of the table. The defaults are smart but these parameters allow you to be specific. See smartgrid for more information.

#### ``redefine``

Tables can be defined only once but you can force web2py to redefine an existing table:

db.define_table('person', Field('name'))

db.define_table('person', Field('name'), redefine=True)

``:code

The redefinition may trigger a migration if field content is different.

[[record_representation]]

#### format: Record representation

It is optional but recommended to specify a format representation for records with the ``format`` parameter.

>>> db.define_table('person', Field('name'), format='%(name)s')

``:code

>>> db.define_table('person', Field('name'), format='%(name)s %(id)s')

``:code

or even more complex ones using a function:

>>> db.define_table('person', Field('name'),

format=lambda r: r.name or 'anonymous')

``:code

The format attribute will be used for two purposes:

- To represent referenced records in select/option drop-downs.

#### rname: Record representation

``rname`` sets a database backend name for the table. This makes the web2py table name an alias, and ``rname`` is the real name used when constructing the query for the backend.

To illustrate just one use, ``rname`` can be used to provide MSSQL fully qualified table names accessing tables belonging to other databases on the server: ``rname = 'db1.dbo.table1'``:code

#### primarykey: Support for legacy tables

``primarykey`` helps support legacy tables with existing primary keys, even multi-part.

See [[Legacy Databases #LegacyDatabases]] below.

bd8ccbb
- Date : 2014-07-27
- added NDB comment

+``ndb``:index

-------------

**SQLite** | ``sqlite://storage.db``

**MySQL** | ``mysql://username:password@localhost/test``

**PostgreSQL** | ``postgres://username:password@localhost/test``

**MSSQL** | ``mssql://username:password@localhost/test``

**FireBird** | ``firebird://username:password@localhost/test``

**Oracle** | ``oracle://username/password@test``

**DB2** | ``db2://username:password@test``

**Ingres** | ``ingres://username:password@localhost/test``

**Sybase** | ``sybase://username:password@localhost/test``

**Informix** | ``informix://username:password@test``

**Teradata** | ``teradata://DSN=dsn;UID=user;PWD=pass;DATABASE=test``

**Cubrid** | ``cubrid://username:password@localhost/test``

**SAPDB** | ``sapdb://username:password@localhost/test``

**IMAP** | ``imap://user:password@server:port``

**MongoDB** | ``mongodb://username:password@localhost/test``

**Google/SQL** | ``google:sql://project:instance/database``

**Google/NoSQL** | ``google:datastore``

+**Google/NoSQL/NDB** | ``google:datastore+ndb``

-------------

Notice that in SQLite the database consists of a single file. If it does not exist, it is created. This file is locked every time it is accessed. In the case of MySQL, PostgreSQL, MSSQL, FireBird, Oracle, DB2, Ingres and Informix the database "test" must be created outside web2py. Once the connection is established, web2py will create, alter, and drop tables appropriately.

+In the Google/NoSQL case the ``+ndb`` option turns on NDB. NDB uses a Memcache buffer to read data that is accessed often. This is completely automatic and done at the datastore level, not at the web2py level.

It is also possible to set the connection string to ``None``. In this case DAL will not connect to any back-end database, but the API can still be accessed for testing. Examples of this will be discussed in Chapter 7.

-------------

**SQLite** | ``sqlite://storage.db``

**MySQL** | ``mysql://username:password@localhost/test``

**PostgreSQL** | ``postgres://username:password@localhost/test``

**MSSQL** | ``mssql://username:password@localhost/test``

**FireBird** | ``firebird://username:password@localhost/test``

**Oracle** | ``oracle://username/password@test``

**DB2** | ``db2://username:password@test``

**Ingres** | ``ingres://username:password@localhost/test``

**Sybase** | ``sybase://username:password@localhost/test``

**Informix** | ``informix://username:password@test``

**Teradata** | ``teradata://DSN=dsn;UID=user;PWD=pass;DATABASE=test``

**Cubrid** | ``cubrid://username:password@localhost/test``

**SAPDB** | ``sapdb://username:password@localhost/test``

**IMAP** | ``imap://user:password@server:port``

**MongoDB** | ``mongodb://username:password@localhost/test``

**Google/SQL** | ``google:sql://project:instance/database``

**Google/NoSQL** | ``google:datastore``

-------------

2a97e28
- Date : 2014-07-23
- clarification of on_defines example

Example:

db = DAL(lazy_tables=True)

db.define_table('person',Field('name'),Field('age','integer'),

on_define=lambda table: [

table.name.set_attributes(requires=IS_NOT_EMPTY(),default=''),

table.age.set_attributes(requires=IS_INT_IN_RANGE(0,120),default=30),

``:code

+Note this example shows how to use ``on_define`` but it is not actually necessary. The simple ``requires`` values could be added to the Field definitions and the table would still be lazy. However, ``requires`` which take a Set object as the first argument, such as IS_IN_DB, will make a query like ``db.sometable.somefield == some_value``:code which would cause ``sometable`` to be defined early. This is the situation saved by ``on_define``.

[[lazy_tables]]

#### Lazy Tables, a major performance boost

``lazy tables``:inxx

web2py models are executed before controllers, so all tables are defined at every request. Not all tables are needed to handle each request, so it is possible that some of the time spent defining tables is wasted. Conditional models ([[conditional models, chapter 4 ../04/#conditional_models]]) can help, but web2py offers a big performance boost via lazy_tables. This feature means that table creation is deferred until the table is actually referenced. Enabling lazy tables is made when initialising a database via the DAL constructor. It requires setting the ``DAL(...,lazy_tables=True)`` parameter. This is one of the most significant response-time performance boosts in web2py.

Example:

db = DAL(lazy_tables=True)

db.define_table('person',Field('name'),Field('age','integer'),

on_define=lambda table: [

table.name.set_attributes(requires=IS_NOT_EMPTY(),default=''),

table.age.set_attributes(requires=IS_INT_IN_RANGE(0,120),default=30),

``:code

[[lazy_tables]]

#### Lazy Tables, a major performance boost

``lazy tables``:inxx

6bb6aa3
- Date : 2014-07-23
- Substantial rewrite of the first part of the DAL chapter, adding better documentation for DAL, Tables and Fields.

+[[dal_constructor]]

+### DAL constructor

+Basic use:

+``

+>>> db = DAL('sqlite://storage.db')

+``:code

+The database is now connected and the connection is stored in the global variable ``db``.

+At any time you can retrieve the connection string.

+``_uri``:inxx

+``

+>>> print db._uri

+sqlite://storage.db

+``:code

+and the database name

+``_dbname``:inxx

+``

+>>> print db._dbname

+sqlite

+``:code

+The connection string is called a ``_uri`` because it is an instance of a Uniform Resource Identifier.

+The DAL allows multiple connections with the same database or with different databases, even databases of different types. For now, we will assume the presence of a single database since this is the most common situation.

+#### DAL signature

+``

+DAL(

+ uri='sqlite://dummy.db',

+ pool_size=0,

+ folder=None,

+ db_codec='UTF-8',

+ check_reserved=None,

+ migrate=True,

+ fake_migrate=False,

+ migrate_enabled=True,

+ fake_migrate_all=False,

+ decode_credentials=False,

+ driver_args=None,

+ adapter_args=None,

+ attempts=5,

+ auto_import=False,

+ bigint_id=False,

+ debug=False,

+ lazy_tables=False,

+ db_uid=None,

+ do_connect=True,

+ after_connection=None,

+ tables=None,

+ ignore_field_case=True,

+ entity_quoting=False,

+ table_hash=None)

+``:code

+[[connection_strings]]

+#### Connection strings (the uri parameter)

``connection strings``:inxx

A connection with the database is established by creating an instance of the DAL object:

>>> db = DAL('sqlite://storage.db', pool_size=0)

``:code

-------------

**SQLite** | ``sqlite://storage.db``

**MySQL** | ``mysql://username:password@localhost/test``

**PostgreSQL** | ``postgres://username:password@localhost/test``

**MSSQL** | ``mssql://username:password@localhost/test``

**FireBird** | ``firebird://username:password@localhost/test``

**Oracle** | ``oracle://username/password@test``

**DB2** | ``db2://username:password@test``

**Ingres** | ``ingres://username:password@localhost/test``

**Sybase** | ``sybase://username:password@localhost/test``

**Informix** | ``informix://username:password@test``

**Teradata** | ``teradata://DSN=dsn;UID=user;PWD=pass;DATABASE=test``

A connection with the database is established by creating an instance of the DAL

-------------

Some times you may need to generate SQL as if you had a connection but without actually connecting to the database. This can be done with

db = DAL('...', do_connect=False)

``:code

In this case you will be able to call ``_select``, ``_insert``, ``_update``, and ``_delete`` to generate SQL but not call ``select``, ``insert``, ``update``, and ``delete``. In most of the cases you can use ``do_connect=False`` even without having the required database drivers.

Notice that by default web2py uses utf8 character encoding for databases. If you work with existing databases that behave differently, you have to change it with the optional parameter ``db_codec`` like

db = DAL('...', db_codec='latin1')

``:code

+Otherwise you'll get UnicodeDecodeError tickets.

#### Connection pooling

``connection pooling``:inxx

A common argument of the DAL constructor is the ``pool_size``; it defaults to zero.

As it is rather slow to establish a new database connection for each request, web2py implements a mechanism for connection pooling. Once a connection is established and the page has been served and the transaction completed, the connection is not closed but goes into a pool. When the next http request arrives, web2py tries to recycle a connection from the pool and use that for the new transaction. If there are no available connections in the pool, a new connection is established.

When web2py starts, the pool is always empty. The pool grows up to the minimum between the value of ``pool_size`` and the max number of concurrent requests. This means that if ``pool_size=10`` but our server never receives more than 5 concurrent requests, then the actual pool size will only grow to 5. If ``pool_size=0`` then connection pooling is not used.

Connections in the pools are shared sequentially among threads, in the sense that they may be used by two different but not simultaneous threads. There is only one pool for each web2py process.

The ``pool_size`` parameter is ignored by SQLite and Google App Engine.

Connection pooling is ignored for SQLite, since it would not yield any benefit.

+#### Connection failures (attempts parameter)

+If web2py fails to connect to the database it waits 1 seconds and by default tries again up to 5 times before declaring a failure. In case of connection pooling it is possible that a pooled connection that stays open but unused for some time is closed by the database end. Thanks to the retry feature web2py tries to re-establish these dropped connections.

+The number of attempts is set via the attempts parameter.

+#### Lazy Tables

+setting ``lazy_tables = True`` provides a major performance boost. See below: [[lazy tables #lazy_tables]]

#### Replicated databases

The first argument of ``DAL(...)`` can be a list of URIs. In this case web2py tries to connect to each of them. The main purpose for this is to deal with multiple database servers and distribute the workload among them). Here is a typical use case:

db = DAL(['mysql://...1','mysql://...2','mysql://...3'])

``:code

In this case the DAL tries to connect to the first and, on failure, it

will try the second and the third. This can also be used to distribute load

in a database master-slave configuration. We will talk more about this

in Chapter 13 in the context of scalability.

#### Reserved keywords

``reserved Keywords``:inxx

``check_reserved`` tells the constructor to check table names and column names against reserved SQL keywords in target back-end databases. ``check_reserved`` defaults to None.

This is a list of strings that contain the database back-end adapter names.

The adapter name is the same as used in the DAL connection string. So if you want to check against PostgreSQL and MSSQL then your connection string would look as follows:

db = DAL('sqlite://storage.db',

check_reserved=['postgres', 'mssql'])

``:code

The DAL will scan the keywords in the same order as of the list.

There are two extra options "all" and "common". If you specify all, it will check against all known SQL keywords. If you specify common, it will only check against common SQL keywords such as ``SELECT``, ``INSERT``, ``UPDATE``, etc.

For supported back-ends you may also specify if you would like to check against the non-reserved SQL keywords as well. In this case you would append ``_nonreserved`` to the name. For example:

check_reserved=['postgres', 'postgres_nonreserved']

``:code

The following database backends support reserved words checking.

-----

**PostgreSQL** | ``postgres(_nonreserved)``

**MySQL** | ``mysql``

**FireBird** | ``firebird(_nonreserved)``

**MSSQL** | ``mssql``

**Oracle** | ``oracle``

-----

#### Database quoting and case settings (entity_quoting, ignore_field)

You can also use explicit quoting of SQL entities at DAL level. It works transparently so you can use the same names in python and in the DB schema.

+``ignore_field_case = True``

+``entity_quoting = True``

+Here is an example:

db = DAL('postgres://...', ...,ignore_field_case=False, entity_quoting=True)

db.define_table('table1', Field('column'), Field('COLUMN'))

print db(db.table1.COLUMN != db.table1.column).select()

``:code

#### Other DAL constructor parameters

+##### Database folder location

+``folder`` – where .table files will be created. Automatically set within web2py. Use an explicit path when using DAL outside web2py

+##### Default migration settings

+Migration is detailed below in Tables [[table migrations #table_migrations]]. The DAL constructor migration settings are booleans affecting defaults and global behaviour.

+``migrate = True`` sets default migrate behavior for all tables

+``fake_migrate = False`` sets default fake_migrate behavior for all tables

+``migrate_enabled = True`` If set to False disables ALL migrations

+``fake_migrate_all = False`` If set to True fake migrates ALL tables

+#### Experiment with the web2py shell

+You can experiment with the DAL API using the web2py shell (-S [[command line option ../04#CommandLineOptions]]).

+Start by creating a connection. For the sake of example, you can use SQLite. Nothing in this discussion changes when you change the back-end engine.

+[[table_constructor]]

+### Table constructor

``define_table``:inxx ``Field``:inxx

+#### define_table signature

+The signature for define_table:

+Tables are defined in the DAL via ``define_table``:

+``

+>>> db.define_table('person', Field('name')

+ rname=None,

+ redefine=True

+ common_filter,

+ fake_migrate,

+ fields,

+ format,

+ migrate,

+ on_define,

+ plural,

+ polymodel,

+ primarykey,

+ redefine,

+ sequence_name,

+ singular,

+ table_class,

+ trigger_name)

``:code

It defines, stores and returns a ``Table`` object called "person" containing a field (column) "name". This object can also be accessed via ``db.person``, so you do not need to catch the return value.

+#### ``id``: The primary key ``id`` field

+Do not declare a field called "id", because one is created by web2py anyway. Every table has a field called "id" by default. It is an auto-increment integer field (starting at 1) used for cross-reference and for making every record unique, so "id" is a primary key. (Note: the id counter starting at 1 is back-end specific. For example, this does not apply to the Google App Engine NoSQL.)

``named id field``:inxx

Optionally you can define a Field of ``type='id'`` and web2py will use this field as auto-increment id field. This is not recommended except when accessing legacy database tables. With some limitation, you can also use different primary keys and this is discussed in the section on "Legacy databases and keyed tables".

+#### ``plural`` and ``singular``

+Smartgrid objects may need to know the singular and plural name of the table. The defaults are smart but these parameters allow you to be specific. See smartgrid for more information.

+#### ``redefine``

Tables can be defined only once but you can force web2py to redefine an existing table:

db.define_table('person', Field('name'))

db.define_table('person', Field('name'), redefine=True)

``:code

The redefinition may trigger a migration if field content is different.

[[record_representation]]

#### format: Record representation

It is optional but recommended to specify a format representation for records with the ``format`` parameter.

>>> db.define_table('person', Field('name'), format='%(name)s')

``:code

>>> db.define_table('person', Field('name'), format='%(name)s %(id)s')

``:code

or even more complex ones using a function:

>>> db.define_table('person', Field('name'),

format=lambda r: r.name or 'anonymous')

``:code

The format attribute will be used for two purposes:

- To represent referenced records in select/option drop-downs.

+#### rname: Record representation

+``rname`` sets a database backend name for the table. This makes the web2py table name an alias, and ``rname`` is the real name used when constructing the query for the backend.

+To illustrate just one use, ``rname`` can be used to provide MSSQL fully qualified table names accessing tables belonging to other databases on the server: ``rname = 'db1.dbo.table1'``:code

+#### primarykey: Support for legacy tables

+``primarykey`` helps support legacy tables with existing primary keys, even multi-part.

+See [[Legacy Databases #LegacyDatabases]] below.

+#### migrate, fake_migrate

+``migrate`` sets migration options for the table. See [[Table Migrations #table_migrations]] below

+#### table_class

+If you define your own Table class as a sub-class of gluon.dal.Table, you can provide it here; this allows you to extend and override methods. Example: ``table_class=MyTable``:code

+#### polymodel

+For Google App Engine

+#### on_define

+``on_define`` is a callback triggered when a lazy_table is instantiated, although it is called anyway if the table is not lazy. This allows dynamic changes to the table without losing the advantages of delayed instantiation.

+Example:

+``

+ db = DAL(lazy_tables=True)

+ db.define_table('person',Field('name'),Field('age','integer'),

+ on_define=lambda table: [

+ table.name.set_attributes(requires=IS_NOT_EMPTY(),default=''),

+ table.age.set_attributes(requires=IS_INT_IN_RANGE(0,120),default=30),

+``:code

+[[lazy_tables]]

+#### Lazy Tables, a major performance boost

+``lazy tables``:inxx

+web2py models are executed before controllers, so all tables are defined at every request. Not all tables are needed to handle each request, so it is possible that some of the time spent defining tables is wasted. Conditional models ([[conditional models, chapter 4 ../04/#conditional_models]]) can help, but web2py offers a big performance boost via lazy_tables. This feature means that table creation is deferred until the table is actually referenced. Enabling lazy tables is made when initialising a database via the DAL constructor. It requires setting the ``DAL(...,lazy_tables=True)`` parameter. This is one of the most significant response-time performance boosts in web2py.

+#### Adding attributes to fields and tables

+If you need to add custom attributes to fields, you can simply do this:

+``db.table.field.extra = {}``:code

+"extra" is not a keyword ; it's a custom attributes now attached to the field object. You can do it with tables too but they must be preceded by an

+underscore to avoid naming conflicts with fields:

+``db.table._extra = {} ``:code

[[field_constructor]]

### Field constructor

``Field constructor``:inxx

These are the default values of a Field constructor:

Field(name, 'string', length=None, default=None,

required=False, requires='<default>',

ondelete='CASCADE', notnull=False, unique=False,

uploadfield=True, widget=None, label=None, comment=None,

writable=True, readable=True, update=None, authorize=None,

autodelete=False, represent=None, compute=None,

uploadfolder=None,

+ uploadseparate=None,uploadfs=None,

+ rname=None)

``:code

Not all of them are relevant for every field. "length" is relevant only for fields of type "string". "uploadfield" and "authorize" are relevant only for fields of type "upload". "ondelete" is relevant only for fields of type "reference" and "upload".

- ``length`` sets the maximum length of a "string", "password" or "upload" field. If ``length`` is not specified a default value is used but the default value is not guaranteed to be backward compatible. ''To avoid unwanted migrations on upgrades, we recommend that you always specify the length for string, password and upload fields.''

- ``default`` sets the default value for the field. The default value is used when performing an insert if a value is not explicitly specified. It is also used to pre-populate forms built from the table using SQLFORM. Note, rather than being a fixed value, the default can instead be a function (including a lambda function) that returns a value of the appropriate type for the field. In that case, the function is called once for each record inserted, even when multiple records are inserted in a single transaction.

- ``required`` tells the DAL that no insert should be allowed on this table if a value for this field is not explicitly specified.

- ``requires`` is a validator or a list of validators. This is not used by the DAL, but it is used by SQLFORM. The default validators for the given types are shown in the following table:

- ``uploadfolder`` while the default is ``None``, most DB adapters will default to uploading files into os.path.join(request.folder, 'uploads'). MongoAdapter does not seem to be doing so at present.

+- ``rname`` provides the field was a "real name", a name for the field known to the database adapter; when the field is used, it is the rname value which is sent to the database. The web2py name for the field is then effectively an alias.

[[field_types]]

#### Field types

``field types``:inxx

----------

**field type** | **default field validators**

``string`` | ``IS_LENGTH(length)`` default length is 512

``text`` | ``IS_LENGTH(65536)``

``blob`` | ``None``

``boolean`` | ``None``

``integer`` | ``IS_INT_IN_RANGE(-1e100, 1e100)``

``double`` | ``IS_FLOAT_IN_RANGE(-1e100, 1e100)``

``decimal(n,m)`` | ``IS_DECIMAL_IN_RANGE(-1e100, 1e100)``

``date`` | ``IS_DATE()``

``time`` | ``IS_TIME()``

``datetime`` | ``IS_DATETIME()``

``password`` | ``None``

``upload`` | ``None``

``reference <table>`` | ``IS_IN_DB(db,table.field,format)``

``list:string`` | ``None``

including its parent table, tablename, and parent connection:

>>> db.person.name._table == db.person

True

>>> db.person.name._tablename == 'person'

True

>>> db.person.name._db == db

True

``:code

A field also has methods. Some of them are used to build queries and we will see them later.

A special method of the field object is ``validate`` and it calls the validators for the field.

print db.person.name.validate('John')

which returns a tuple ``(value, error)``. ``error`` is ``None`` if the input passes validation.

+[[table_migrations]]

### Migrations

``migrations``:inxx

``define_table`` checks whether or not the corresponding table exists. If it does not, it generates the SQL to create it and executes the SQL. If the table does exist but differs from the one being defined, it generates the SQL to alter the table and executes it. If a field has changed type but not name, it will try to convert the data (If you do not want this, you need to redefine the table twice, the first time, letting web2py drop the field by removing it, and the second time adding the newly defined field so that web2py can create it.). If the table exists and matches the current definition, it will leave it alone. In all cases it will create the ``db.person`` object that represents the table.

We refer to this behavior as a "migration". web2py logs all migrations and migration attempts in the file "databases/sql.log".

The first argument of ``define_table`` is always the table name. The other unnamed arguments are the fields (Field). The function also takes an optional keyword argument called "migrate":

>>> db.define_table('person', Field('name'), migrate='person.table')

``:code

The value of migrate is the filename (in the "databases" folder for the application) where web2py stores internal migration information for this table.

These files are very important and should never be removed while the corresponding tables exist. In cases where a table has been dropped and the corresponding file still exist, it can be removed manually. By default, migrate is set to True. This causes web2py to generate the filename from a hash of the connection string. If migrate is set to False, the migration is not performed, and web2py assumes that the table exists in the datastore and it contains (at least) the fields listed in ``define_table``.

The best practice is to give an explicit name to the migrate table.

There may not be two tables in the same application with the same migrate filename.

The DAL class also takes a "migrate" argument, which determines the default value of migrate for calls to ``define_table``. For example,

>>> db = DAL('sqlite://storage.db', migrate=False)

Finally, you can drop tables and all data will be lost:

``drop``:inxx

>>> db.person.drop()

``:code

Note for sqlite: web2py will not re-create the dropped table until you navigate the file system to the databases directory of your app, and delete the file associated with the dropped table.

### Indexes

Currently the DAL API does not provide a command to create indexes on tables, but this can be done using the ``executesql`` command. This is because the existence of indexes can make migrations complex, and it is better to deal with them explicitly. Indexes may be needed for those fields that are used in recurrent queries.

Here is an example of how to [[create an index using SQL in SQLite http://www.sqlite.org/lang_createindex.html]]:

>>> db = DAL('sqlite://storage.db')

>>> db.define_table('person', Field('name'))

>>> db.executesql('CREATE INDEX IF NOT EXISTS myidx ON person (name);')

``:code

Other database dialects have very similar syntaxes but may not support the optional "IF NOT EXISTS" directive.

+[[LegacyDatabases]]

### Legacy databases and keyed tables

web2py can connect to legacy databases under some conditions.

The easiest way is when these conditions are met:

- Each table must have a unique auto-increment integer field called "id"

- Records must be referenced exclusively using the "id" field.

When accessing an existing table, i.e., a table not created by web2py in the current application, always set ``migrate=False``.

If the legacy table has an auto-increment integer field but it is not called "id", web2py can still access it but the table definition must contain explicitly as ``Field('....','id')`` where ... is the name of the auto-increment integer field.

``keyed table``:inxx

Finally if the legacy table uses a primary key that is not an auto-increment id field it is possible to use a "keyed table", for example:

db.define_table('account',

Field('accnum','integer'),

Field('acctype'),

Field('accdesc'),

When records are deleted, they are not really deleted. A deleted record is copie

``db._common_fields`` is a list of fields that should belong to all the tables. This list can also contain tables and it is understood as all fields from the table. For example occasionally you find yourself in need to add a signature to all your tables but the ```auth`` tables. In this case, after you ``db.define_tables()`` but before defining any other table, insert

db._common_fields.append(auth.signature)

One field is special: "request_tenant".

This field does not exist but you can create it and add it to any of your tables (or them all):

db._common_fields.append(Field('request_tenant',

default=request.env.http_host,writable=False))

For every table with a field called ``db._request_tenant``, all records for all queries are always automatically filtered by:

db.table.request_tenant == db.table.request_tenant.default

``:code

and for every record inserted, this field is set to the default value.

In the example above we have chosen

default = request.env.http_host

i.e. we have chose to ask our app to filter all tables in all queries with

db.table.request_tenant == request.env.http_host

This simple trick allow us to turn any application into a multi-tenant application. i.e. even if we run one instance of the app and we use one single database, if the app is accessed under two or more domains (in the example the domain name is retrieved from ``request.env.http_host``) the visitors will see different data depending on the domain. Think of running multiple web stores under different domains with one app and one database.

You can turn off multi tenancy filters using: ``ignore_common_filters``:inxx

rows = db(query, ignore_common_filters=True).select()

``:code

#### Common filters

A common filter is a generalization of the above multi-tenancy idea.

It provides an easy way to prevent repeating of the same query.

db = DAL('sqlite://storage.sqlite')

and you wish to move to another database using a different connection string:

db = DAL('postgres://username:password@localhost/mydb')

Before you switch, you want to move the data and rebuild all the metadata for the new database. We assume the new database to exist but we also assume it is empty.

Web2py provides a script that does this work for you:

cd web2py

python scripts/cpdb.py \

-f applications/app/databases \

-y 'sqlite://storage.sqlite' \

-Y 'postgres://username:password@localhost/mydb'

After running the script you can simply switch the connection string in the model and everything should work out of the box. The new data should be there.

This script provides various command line options that allows you to move data from one application to another, move all tables or only some tables, clear the data in the tables. For more info try:

-### Connection strings

``connection strings``:inxx

A connection with the database is established by creating an instance of the DAL object:

>>> db = DAL('sqlite://storage.db', pool_size=0)

``:code

-------------

**SQLite** | ``sqlite://storage.db``

**MySQL** | ``mysql://username:password@localhost/test``

**PostgreSQL** | ``postgres://username:password@localhost/test``

**MSSQL** | ``mssql://username:password@localhost/test``

**FireBird** | ``firebird://username:password@localhost/test``

**Oracle** | ``oracle://username/password@test``

**DB2** | ``db2://username:password@test``

**Ingres** | ``ingres://username:password@localhost/test``

**Sybase** | ``sybase://username:password@localhost/test``

**Informix** | ``informix://username:password@test``

**Teradata** | ``teradata://DSN=dsn;UID=user;PWD=pass;DATABASE=test``

A connection with the database is established by creating an instance of the DAL

-------------

Some times you may need to generate SQL as if you had a connection but without actually connecting to the database. This can be done with

db = DAL('...', do_connect=False)

``:code

db = DAL('...', db_codec='latin1')

``:code

-otherwise you'll get UnicodeDecodeError tickets.

#### Connection pooling

``connection pooling``:inxx

The second argument of the DAL constructor is the ``pool_size``; it defaults to zero.

Connections in the pools are shared sequentially among threads, in the sense that they may be used by two different but not simultaneous threads. There is only one pool for each web2py process.

The ``pool_size`` parameter is ignored by SQLite and Google App Engine.

Connection pooling is ignored for SQLite, since it would not yield any benefit.

-#### Connection failures

-If web2py fails to connect to the database it waits 1 seconds and tries again up to 5 times before declaring a failure. In case of connection pooling it is possible that a pooled connection that stays open but unused for some time is closed by the database end. Thanks to the retry feature web2py tries to re-establish these dropped connections.

#### Replicated databases

db = DAL(['mysql://...1','mysql://...2','mysql://...3'])

``:code

In this case the DAL tries to connect to the first and, on failure, it

will try the second and the third. This can also be used to distribute load

in a database master-slave configuration. We will talk more about this

in Chapter 13 in the context of scalability.

### Reserved keywords

``reserved Keywords``:inxx

``check_reserved`` is yet another argument that can be passed to the DAL constructor. It tells it to check table names and column names against reserved SQL keywords in target back-end databases. ``check_reserved`` defaults to None.

This is a list of strings that contain the database back-end adapter names.

The adapter name is the same as used in the DAL connection string. So if you want to check against PostgreSQL and MSSQL then your connection string would look as follows:

db = DAL('sqlite://storage.db',

check_reserved=['postgres', 'mssql'])

``:code

The DAL will scan the keywords in the same order as of the list.

For supported back-ends you may also specify if you would like to check against the non-reserved SQL keywords as well. In this case you would append ``_nonreserved`` to the name. For example:

check_reserved=['postgres', 'postgres_nonreserved']

``:code

The following database backends support reserved words checking.

-----

**PostgreSQL** | ``postgres(_nonreserved)``

**MySQL** | ``mysql``

**FireBird** | ``firebird(_nonreserved)``

**MSSQL** | ``mssql``

**Oracle** | ``oracle``

-----

-[[DAL_table_field]]

-### ``DAL``, ``Table``, ``Field``

You can experiment with the DAL API using the web2py shell.

Start by creating a connection. For the sake of example, you can use SQLite. Nothing in this discussion changes when you change the back-end engine.

-[[dal_constructor]]

-#### DAL constructor

->>> db = DAL('sqlite://storage.db')

``:code

The database is now connected and the connection is stored in the global variable ``db``.

-At any time you can retrieve the connection string.

-``_uri``:inxx

-``

->>> print db._uri

sqlite://storage.db

``:code

-and the database name

-``_dbname``:inxx

-``

->>> print db._dbname

-sqlite

``:code

-The connection string is called a ``_uri`` because it is an instance of a Uniform Resource Identifier.

-The DAL allows multiple connections with the same database or with different databases, even databases of different types. For now, we will assume the presence of a single database since this is the most common situation.

-[[table_constructor]]

-#### Table constructor

``define_table``:inxx ``Field``:inxx

-``type``:inxx ``length``:inxx ``default``:inxx ``requires``:inxx ``required``:inxx ``unique``:inxx

-``notnull``:inxx ``ondelete``:inxx ``uploadfield``:inxx ``uploadseparate``:inxx ``migrate``:inxx ``sql.log``:inxx

-The most important method of a DAL is ``define_table``:

-``

->>> db.define_table('person', Field('name'))

``:code

It defines, stores and returns a ``Table`` object called "person" containing a field (column) "name". This object can also be accessed via ``db.person``, so you do not need to catch the return value.

-Do not declare a field called "id", because one is created by web2py anyway. Every table has a field called "id" by default. It is an auto-increment integer field (starting at 1) used for cross-reference and for making every record unique, so "id" is a primary key. (Note: the id's starting at 1 is back-end specific. For example, this does not apply to the Google App Engine NoSQL.)

``named id field``:inxx

Optionally you can define a field of ``type='id'`` and web2py will use this field as auto-increment id field. This is not recommended except when accessing legacy database tables. With some limitation, you can also use different primary keys and this is discussed in the section on "Legacy databases and keyed tables".

Tables can be defined only once but you can force web2py to redefine an existing table:

db.define_table('person', Field('name'))

db.define_table('person', Field('name'), redefine=True)

``:code

The redefinition may trigger a migration if field content is different.

-[[lazy_tables]]

-#### Lazy Tables, a major performance boost

-``lazy tables``:inxx

-web2py models are executed before controllers, so all tables are defined at every request. Not all tables are needed to handle each request, so it is possible that some of the time spent defining tables is wasted. Conditional models ([[conditional models, chapter 4 ../04/#conditional_models]]) can help, but web2py offers a big performance boost via lazy_tables. This feature means that table creation is deferred until the table is actually referenced. Enabling lazy tables requires setting the ``DAL(...,lazy_tables=True)`` parameter. This is one of the most significant response-time performance boosts in web2py.

-#### Adding attributes to fields and tables

-If you need to add custom attributes to fields, you can simply do this:

-``db.table.field.extra = {}``:code

-"extra" is not a keyword ; it's a custom attributes now attached to the field object. You can do it with tables too but they must be preceded by an

-underscore to avoid naming conflicts with fields:

-``db.table._extra = {} ``:code

[[record_representation]]

#### Record representation

It is optional but recommended to specify a format representation for records:

>>> db.define_table('person', Field('name'), format='%(name)s')

``:code

>>> db.define_table('person', Field('name'), format='%(name)s %(id)s')

``:code

or even more complex ones using a function:

>>> db.define_table('person', Field('name'),

format=lambda r: r.name or 'anonymous')

``:code

The format attribute will be used for two purposes:

- To represent referenced records in select/option drop-downs.

[[field_constructor]]

#### Field constructor

``Field constructor``:inxx

These are the default values of a Field constructor:

Field(name, 'string', length=None, default=None,

required=False, requires='<default>',

ondelete='CASCADE', notnull=False, unique=False,

uploadfield=True, widget=None, label=None, comment=None,

writable=True, readable=True, update=None, authorize=None,

autodelete=False, represent=None, compute=None,

uploadfolder=None,

- uploadseparate=None,uploadfs=None)

``:code

- ``required`` tells the DAL that no insert should be allowed on this table if a value for this field is not explicitly specified.

- ``requires`` is a validator or a list of validators. This is not used by the DAL, but it is used by SQLFORM. The default validators for the given types are shown in the following table:

- ``uploadfolder`` while the default is ``None``, most DB adapters will default to uploading files into os.path.join(request.folder, 'uploads'). MongoAdapter does not seem to be doing so at present.

[[field_types]]

#### Field types

``field types``:inxx

----------

**field type** | **default field validators**

``string`` | ``IS_LENGTH(length)`` default length is 512

``text`` | ``IS_LENGTH(65536)``

``blob`` | ``None``

``boolean`` | ``None``

``integer`` | ``IS_INT_IN_RANGE(-1e100, 1e100)``

``double`` | ``IS_FLOAT_IN_RANGE(-1e100, 1e100)``

``decimal(n,m)`` | ``IS_DECIMAL_IN_RANGE(-1e100, 1e100)``

``date`` | ``IS_DATE()``

``time`` | ``IS_TIME()``

``datetime`` | ``IS_DATETIME()``

``password`` | ``None``

``upload`` | ``None``

``reference <table>`` | ``IS_IN_DB(db,table.field,format)``

``list:string`` | ``None``

including its parent table, tablename, and parent connection:

>>> db.person.name._table == db.person

True

>>> db.person.name._tablename == 'person'

True

>>> db.person.name._db == db

True

``:code

A field also has methods. Some of them are used to build queries and we will see them later.

A special method of the field object is ``validate`` and it calls the validators for the field.

print db.person.name.validate('John')

which returns a tuple ``(value, error)``. ``error`` is ``None`` if the input passes validation.

### Migrations

``migrations``:inxx

We refer to this behavior as a "migration". web2py logs all migrations and migration attempts in the file "databases/sql.log".

The first argument of ``define_table`` is always the table name. The other unnamed arguments are the fields (Field). The function also takes an optional keyword argument called "migrate":

>>> db.define_table('person', Field('name'), migrate='person.table')

``:code

The value of migrate is the filename (in the "databases" folder for the application) where web2py stores internal migration information for this table.

The best practice is to give an explicit name to the migrate table.

There may not be two tables in the same application with the same migrate filename.

The DAL class also takes a "migrate" argument, which determines the default value of migrate for calls to ``define_table``. For example,

>>> db = DAL('sqlite://storage.db', migrate=False)

Finally, you can drop tables and all data will be lost:

``drop``:inxx

>>> db.person.drop()

``:code

Note for sqlite: web2py will not re-create the dropped table until you navigate the file system to the databases directory of your app, and delete the file associated with the dropped table.

### Indexes

Here is an example of how to [[create an index using SQL in SQLite http://www.sqlite.org/lang_createindex.html]]:

>>> db = DAL('sqlite://storage.db')

>>> db.define_table('person', Field('name'))

>>> db.executesql('CREATE INDEX IF NOT EXISTS myidx ON person (name);')

``:code

Other database dialects have very similar syntaxes but may not support the optional "IF NOT EXISTS" directive.

### Legacy databases and keyed tables

web2py can connect to legacy databases under some conditions.

The easiest way is when these conditions are met:

- Each table must have a unique auto-increment integer field called "id"

- Records must be referenced exclusively using the "id" field.

When accessing an existing table, i.e., a table not created by web2py in the current application, always set ``migrate=False``.

``keyed table``:inxx

Finally if the legacy table uses a primary key that is not an auto-increment id field it is possible to use a "keyed table", for example:

db.define_table('account',

Field('accnum','integer'),

Field('acctype'),

Field('accdesc'),

When records are deleted, they are not really deleted. A deleted record is copie

db._common_fields.append(auth.signature)

One field is special: "request_tenant".

This field does not exist but you can create it and add it to any of your tables (or them all):

db._common_fields.append(Field('request_tenant',

default=request.env.http_host,writable=False))

For every table with a field called ``db._request_tenant``, all records for all queries are always automatically filtered by:

db.table.request_tenant == db.table.request_tenant.default

``:code

and for every record insert, this field is set to the default value.

In the example above we have chosen

default = request.env.http_host

i.e. we have chose to ask our app to filter all tables in all queries with

db.table.request_tenant == request.env.http_host

You can turn off multi tenancy filters using: ``ignore_common_filters``:inxx

rows = db(query, ignore_common_filters=True).select()

``:code

#### Common filters

A common filter is a generalization of the above multi-tenancy idea.

It provides an easy way to prevent repeating of the same query.

db = DAL('sqlite://storage.sqlite')

and you wish to move to another database using a different connection string:

db = DAL('postgres://username:password@localhost/mydb')

Before you switch, you want to move the data and rebuild all the metadata for the new database. We assume the new database to exist but we also assume it is empty.

Web2py provides a script that does this work for you:

cd web2py

python scripts/cpdb.py \

-f applications/app/databases \

-y 'sqlite://storage.sqlite' \

-Y 'postgres://username:password@localhost/mydb'

After running the script you can simply switch the connection string in the model and everything should work out of the box. The new data should be there.

This script provides various command line options that allows you to move data from one application to another, move all tables or only some tables, clear the data in the tables. for more info try:

5eedfc1
- Date : 2014-04-11
- Additional note to default uploadsfolder behavior

- ``required`` tells the DAL that no insert should be allowed on this table if a value for this field is not explicitly specified.

- ``requires`` is a validator or a list of validators. This is not used by the DAL, but it is used by SQLFORM. The default validators for the given types are shown in the following table:

- ``uploadfolder`` while the default is ``None``, most DB adapters will default to uploading files into os.path.join(request.folder, 'uploads'). MongoAdapter does not seem to be doing so at present.

[[field_types]]

#### Field types

``field types``:inxx

- ``required`` tells the DAL that no insert should be allowed on this table if a value for this field is not explicitly specified.

- ``requires`` is a validator or a list of validators. This is not used by the DAL, but it is used by SQLFORM. The default validators for the given types are shown in the following table:

[[field_types]]

#### Field types

``field types``:inxx

5b7af5a
- Date : 2014-03-28
- Updated default value of the uploadfolder

[[field_constructor]]

#### Field constructor

``Field constructor``:inxx

These are the default values of a Field constructor:

Field(name, 'string', length=None, default=None,

required=False, requires='<default>',

ondelete='CASCADE', notnull=False, unique=False,

uploadfield=True, widget=None, label=None, comment=None,

writable=True, readable=True, update=None, authorize=None,

autodelete=False, represent=None, compute=None,

uploadfolder=None,

uploadseparate=None,uploadfs=None)

``:code

[[field_constructor]]

#### Field constructor

``Field constructor``:inxx

These are the default values of a Field constructor:

Field(name, 'string', length=None, default=None,

required=False, requires='<default>',

ondelete='CASCADE', notnull=False, unique=False,

uploadfield=True, widget=None, label=None, comment=None,

writable=True, readable=True, update=None, authorize=None,

autodelete=False, represent=None, compute=None,

uploadfolder=os.path.join(request.folder,'uploads'),

uploadseparate=None,uploadfs=None)

``:code

8cfdd59
- Date : 2014-03-26
- fix googel to google ch6

#### Google SQL

Google SQL has the same problems as MySQL and more. In particular table metadata itself must be stored in the database in a table that is not migrated by web2py. This is because Google App Engine has a read-only file system. Web2py migrations in Google:SQL combined with the MySQL issue described above can result in metadata corruption. Again, this can be prevented (by migrating the table at once and then setting migrate=False so that the metadata table is not accessed any more) or it can fixed a posteriori (by accessing the database using the Google dashboard and deleting any corrupted entry from the table called ``web2py_filesystem``.

#### MSSQL (Microsoft SQL Server)

``limitby``:inxx

MSSQL does not support the SQL OFFSET keyword. Therefore the database cannot do pagination. When doing a ``limitby=(a,b)`` web2py will fetch the first ``b`` rows and discard the first ``a``. This may result in a considerable overhead when compared with other database engines.

#### Oracle

Oracle also does not support pagination. It does not support neither the OFFSET nor the LIMIT keywords. Web2py achieves pagination by translating a ``db(...).select(limitby=(a,b))`` into a complex three-way nested select (as suggested by official Oracle documentation).

This works for simple select but may break for complex selects involving aliased fields and or joins.

#### MSSQL

MSSQL has problems with circular references in tables that have ONDELETE CASCADE. This is an MSSQL bug and you work around it by setting the ondelete attribute for all reference fields to "NO ACTION".

You can also do it once and for all before you define tables:

db = DAL('mssql://....')

for key in ['reference','reference FK']:

db._adapter.types[key]=db._adapter.types[key].replace(

'%(on_delete_action)s','NO ACTION')

``:code

MSSQL also has problems with arguments passed to the DISTINCT keyword and therefore

while this works,

db(query).select(distinct=True)

this does not

db(query).select(distinct=db.mytable.myfield)

#### Google NoSQL (Datastore)

Google NoSQL (Datastore) does not allow joins, left joins, aggregates, expression, OR involving more than one table, the ‘like’ operator searches in "text" fields.

#### Googel SQL

#### MSSQL (Microsoft SQL Server)

``limitby``:inxx

#### Oracle

This works for simple select but may break for complex selects involving aliased fields and or joins.

#### MSSQL

You can also do it once and for all before you define tables:

db = DAL('mssql://....')

for key in ['reference','reference FK']:

db._adapter.types[key]=db._adapter.types[key].replace(

'%(on_delete_action)s','NO ACTION')

``:code

MSSQL also has problems with arguments passed to the DISTINCT keyword and therefore

while this works,

db(query).select(distinct=True)

this does not

db(query).select(distinct=db.mytable.myfield)

#### Googel NoSQL (Datastore)

Google NoSQL (Datastore) does not allow joins, left joins, aggregates, expression, OR involving more than one table, the ‘like’ operator searches in "text" fields.

e7b0f37
- Date : 2014-03-26
- add notes for like operator endswith

#### ``like``, ``regexp``, ``startswith``, ``endswith``, ``contains``, ``upper``, ``lower``

``like``:inxx ``startswith``:inxx ``endswith``:inxx ``regexp``:inxx

``contains``:inxx ``upper``:inxx ``lower``:inxx

Fields have a like operator that you can use to match strings:

>>> for row in db(db.log.event.like('port%')).select():

print row.event

port scan

``:code

Here "port%" indicates a string starting with "port". The percent sign character, "%", is a wild-card character that means "any sequence of characters".

The like operator is case-insensitive but it can be made case-sensitive with

db.mytable.myfield.like('value',case_sensitive=True)

``:code

web2py also provides some shortcuts:

db.mytable.myfield.startswith('value')

+db.mytable.myfield.endswith('value')

db.mytable.myfield.contains('value')

``:code

which are equivalent respectively to

db.mytable.myfield.like('value%')

+db.mytable.myfield.like('%value')

db.mytable.myfield.like('%value%')

``:code

#### ``like``, ``regexp``, ``startswith``, ``contains``, ``upper``, ``lower``

``like``:inxx ``startswith``:inxx ``regexp``:inxx

``contains``:inxx ``upper``:inxx ``lower``:inxx

Fields have a like operator that you can use to match strings:

>>> for row in db(db.log.event.like('port%')).select():

print row.event

port scan

``:code

Here "port%" indicates a string starting with "port". The percent sign character, "%", is a wild-card character that means "any sequence of characters".

The like operator is case-insensitive but it can be made case-sensitive with

db.mytable.myfield.like('value',case_sensitive=True)

``:code

web2py also provides some shortcuts:

db.mytable.myfield.startswith('value')

db.mytable.myfield.contains('value')

``:code

which are equivalent respectively to

db.mytable.myfield.like('value%')

db.mytable.myfield.like('%value%')

``:code

9afbe23
- Date : 2014-02-27
- Update 06.markmin

``SQLCustomType`` is a field type factory. Its ``type`` argument must be one of the standard web2py types. It tells web2py how to treat the field values at the web2py level. ``native`` is the type of the field as far as the database is concerned. Allowed names depend on the database engine. ``encoder`` is an optional transformation function applied when the data is stored and ``decoder`` is the optional reversed transformation function.

``SQLCustomType`` is a field type factory. Its ``type`` argument must be one of the standard web2py types. It tells web2py how to treat the field values at the web2py level. ``native`` is the name of the field as far as the database is concerned. Allowed names depend on the database engine. ``encoder`` is an optional transformation function applied when the data is stored and ``decoder`` is the optional reversed transformation function.

8b71a64
- Date : 2014-02-27
- Update 06.markmin

+- ``linkto`` lambda function or an action to be used to link reference fields (default to None).

+If you assign it a string with the name of an action, it will generate a link to that function passing it, as args, the name of the table and the id of each record (in this order). Example:

+``

+linkto = 'pointed_function' # generates something like <a href="pointed_function/table_name/id_value">

+``:code

+If you want a different link to be generated, you can specify a lambda, wich will receive as parameters, the value of the id, the type of the object (e.g. table), and the name of the object. For example, if you want to receive the args in reverse order:

+``

+linkto = lambda id, type, name: URL(f='pointed_function', args=[id, name])

+``:code

- ``upload`` the URL or the download action to allow downloading of uploaded files (default to None)

- ``headers`` a dictionary mapping field names to their labels to be used as headers (default to ``{}``). It can also be an instruction. Currently we support ``headers='fieldname:capitalize'``.

- ``truncate`` the number of characters for truncating long values in the table (default is 16)

- ``columns`` the list of fieldnames to be shown as columns (in tablename.fieldname format).

Those not listed are not displayed (defaults to all).

- ``**attributes`` generic helper attributes to be passed to the most external TABLE object.

-- ``linkto`` the URL or an action to be used to link reference fields (default to None)

- ``upload`` the URL or the download action to allow downloading of uploaded files (default to None)

- ``headers`` a dictionary mapping field names to their labels to be used as headers (default to ``{}``). It can also be an instruction. Currently we support ``headers='fieldname:capitalize'``.

- ``truncate`` the number of characters for truncating long values in the table (default is 16)

- ``columns`` the list of fieldnames to be shown as columns (in tablename.fieldname format).

Those not listed are not displayed (defaults to all).

- ``**attributes`` generic helper attributes to be passed to the most external TABLE object.

b7ec98b
- Date : 2014-02-26
- typo

[[lazy_tables]]

#### Lazy Tables, a major performance boost

``lazy tables``:inxx

web2py models are executed before controllers, so all tables are defined at every request. Not all tables are needed to handle each request, so it is possible that some of the time spent defining tables is wasted. Conditional models ([[conditional models, chapter 4 ../04/#conditional_models]]) can help, but web2py offers a big performance boost via lazy_tables. This feature means that table creation is deferred until the table is actually referenced. Enabling lazy tables requires setting the ``DAL(...,lazy_tables=True)`` parameter. This is one of the most significant response-time performance boosts in web2py.

[[lazy_tables]]

#### Lazy Tables, a major performance boost

``lazy tables``:inxx

web2py models are executed before controllers, so all tables are defined at every request. Not all tables are needed to handle each request, so it is possible that some of the time spent defining tables is wasted. Conditional models [[conditional models, chapter 4 ../04/#conditional_models]] can help, but web2py offers a big performance boost via lazy_tables. This feature means that table creation is deferred until the table is actually referenced. Enabling lazy tables requires setting the ``DAL(...,lazy_tables=True)`` parameter. This is one of the most significant response-time performance boosts in web2py.

b498c5c
- Date : 2014-02-26
- Errata per github issue #165

+[[lazy_tables]]

+#### Lazy Tables, a major performance boost

+``lazy tables``:inxx

+web2py models are executed before controllers, so all tables are defined at every request. Not all tables are needed to handle each request, so it is possible that some of the time spent defining tables is wasted. Conditional models [[conditional models, chapter 4 ../04/#conditional_models]] can help, but web2py offers a big performance boost via lazy_tables. This feature means that table creation is deferred until the table is actually referenced. Enabling lazy tables requires setting the ``DAL(...,lazy_tables=True)`` parameter. This is one of the most significant response-time performance boosts in web2py.

#### Adding attributes to fields and tables

If you need to add custom attributes to fields, you can simply do this:

``db.table.field.extra = {}``:code

"extra" is not a keyword ; it's a custom attributes now attached to the field object. You can do it with tables too but they must be preceded by an

underscore to avoid naming conflicts with fields:

``db.table._extra = {} ``:code

[[record_representation]]

#### Record representation

It is optional but recommended to specify a format representation for records:

>>> db.define_table('person', Field('name'), format='%(name)s')

``:code

Here is an example:

``SQLFORM.grid``:inxx ``SQLFORM.smartgrid``:inxx

------

``SQLTABLE`` is useful but there are times when one needs more. ``SQLFORM.grid`` is an extension of SQLTABLE that creates a table with search features and pagination, as well as ability to open detailed records, create, edit and delete records. ``SQLFORM.smartgrid`` is a further generalization that allows all of the above but also creates buttons to access referencing records.

------

Here is an example of usage of ``SQLFORM.grid``:

def index():

return dict(grid=SQLFORM.grid(query))

``:code

and the corresponding view:

For working with multiple rows, ``SQLFORM.grid`` and ``SQLFORM.smartgrid`` are preferred to ``SQLTABLE`` because they are more powerful. Please see chapter 7.

#### ``orderby``, ``groupby``, ``limitby``, ``distinct``, ``having``,``orderby_on_limitby``,``left``,``cache``

The ``select`` command takes a number of optional arguments.

##### orderby

You can fetch the records sorted by name:

``orderby``:inxx ``groupby``:inxx ``having``:inxx

>>> for row in db().select(

db.person.ALL, orderby=db.person.name):

print row.name

Alex

Bob

Carl

``:code

You can fetch the records sorted by name in reverse order (notice the tilde):

Curt

A lighter alternative to Many 2 Many relations is tagging. Tagging is discussed in the context of the ``IS_IN_DB`` validator. Tagging works even on database backends that do not support JOINs like the Google App Engine NoSQL.

### ``list:<type>`` and ``contains``

``list:string``:inxx

``list:integer``:inxx

``list:reference``:inxx

``contains``:inxx

``multiple``:inxx

``tags``:inxx

web2py provides the following special field types:

list:string

list:integer

list:reference <table>

``:code

They can contain lists of strings, of integers and of references respectively.

On Google App Engine NoSQL ``list:string`` is mapped into ``StringListProperty``, the other two are mapped into ``ListProperty(int)``. On relational databases they are mapped into text fields which contain the list of items separated by ``|``. For example ``[1,2,3]`` is mapped into ``|1|2|3|``.

-Because usually in web2py models are executed before controllers, it is possible that some table are defined even if not needed. It is therefore necessary to speed up the code by making table definitions lazy. This is done by setting the ``DAL(...,lazy_tables=True)`` parameter. Tables will be actually created only when accessed.

#### Adding attributes to fields and tables

If you need to add custom attributes to fields, you can simply do this:

``db.table.field.extra = {}``:code

"extra" is not a keyword ; it's a custom attributes now attached to the field object. You can do it with tables too but they must be preceded by an

underscore to avoid naming conflicts with fields:

``db.table._extra = {} ``:code

[[record_representation]]

#### Record representation

It is optional but recommended to specify a format representation for records:

>>> db.define_table('person', Field('name'), format='%(name)s')

``:code

Here is an example:

``SQLFORM.grid``:inxx ``SQLFORM.smartgrid``:inxx

------

Here is an example of usage of ``SQLFORM.grid``:

def index():

return dict(grid=SQLFORM.grid(query))

``:code

and the corresponding view:

``SQLFORM.grid`` and ``SQLFORM.smartgrid`` should be preferred to ``SQLTABLE`` because they are more powerful although higher level and therefore more constraining. They will be explained in more detail in chapter 7.

#### ``orderby``, ``groupby``, ``limitby``, ``distinct``, ``having``,``orderby_on_limitby``,``left``,``cache``

The ``select`` command takes a number of optional arguments.

##### orderby

You can fetch the records sorted by name:

``orderby``:inxx ``groupby``:inxx ``having``:inxx

>>> for row in db().select(

db.person.ALL, orderby=db.person.name):

print row.name

Alex

Bob

Carl

``:code

You can fetch the records sorted by name in reverse order (notice the tilde):

Curt

### ``list:<type>`` and ``contains``

``list:string``:inxx

``list:integer``:inxx

``list:reference``:inxx

``contains``:inxx

``multiple``:inxx

``tags``:inxx

web2py provides the following special field types:

list:string

list:integer

list:reference <table>

``:code

They can contain lists of strings, of integers and of references respectively.

On Google App Engine NoSQL ``list:string`` is mapped into ``StringListProperty``, the other two are mapped into ``ListProperty(int)``. On relational databases they all mapped into text fields which contain the list of items separated by ``|``. For example ``[1,2,3]`` is mapped into ``|1|2|3|``.

2649834
- Date : 2014-02-16
- ch7 validators organised by category for easier discovery. Correction to IS_IN_SET when combined with numerical validators, thanks Horst. Ch 6 Noted that del record with shortcut [id] doesn't work with versioning.

You can delete records by id:

del db.mytable[id]

``:code

and this is equivalent to

db(db.mytable.id==id).delete()

``:code

and deletes the record with the given ``id``, if it exists.

+Note: This delete shortcut syntax does not currently work if [[versioning #versioning]] is activated

You can insert records:

db.mytable[0] = dict(myfield='somevalue')

``:code

It is equivalent to

db.mytable.insert(myfield='somevalue')

``:code

and it creates a new record with field values specified by the dictionary on the right hand side.

You can update records:

db.mytable[id] = dict(myfield='somevalue')

``:code

Here ``f`` is a dict of fields passed to insert or update, ``id`` is the id of t

>>> db.person.insert(name='John')

({'name': 'John'},)

({'name': 'John'}, 1)

>>> db(db.person.id==1).update(name='Tim')

(<Set (person.id = 1)>, {'name': 'Tim'})

>>> db(db.person.id==1).delete()

(<Set (person.id = 1)>,)

``:code

The return values of these callback should be ``None`` or ``False``. If any of the ``_before_*`` callback returns a ``True`` value it will abort the actual insert/update/delete operation.

``update_naive``:inxx.

Some times a callback may need to perform an update in the same or a different table and one wants to avoid callbacks calling themselves recursively.

For this purpose there the Set objects have an ``update_naive`` method that works like ``update`` but ignores before and after callbacks.

+[[versioning]]

#### Record versioning

``_enable_record_versioning``:inxx

You can delete records by id:

del db.mytable[id]

``:code

and this is equivalent to

db(db.mytable.id==id).delete()

``:code

and deletes the record with the given ``id``, if it exists.

You can insert records:

db.mytable[0] = dict(myfield='somevalue')

``:code

It is equivalent to

db.mytable.insert(myfield='somevalue')

``:code

and it creates a new record with field values specified by the dictionary on the right hand side.

You can update records:

db.mytable[id] = dict(myfield='somevalue')

``:code

Here ``f`` is a dict of fields passed to insert or update, ``id`` is the id of t

>>> db.person.insert(name='John')

({'name': 'John'},)

({'name': 'John'}, 1)

>>> db(db.person.id==1).update(name='Tim')

(<Set (person.id = 1)>, {'name': 'Tim'})

>>> db(db.person.id==1).delete()

(<Set (person.id = 1)>,)

``:code

The return values of these callback should be ``None`` or ``False``. If any of the ``_before_*`` callback returns a ``True`` value it will abort the actual insert/update/delete operation.

``update_naive``:inxx.

Some times a callback may need to perform an update in the same or a different table and one wants to avoid callbacks calling themselves recursively.

For this purpose there the Set objects have an ``update_naive`` method that works like ``update`` but ignores before and after callbacks.

#### Record versioning

``_enable_record_versioning``:inxx

1984ff4
- Date : 2014-02-11
- Some typos and some proposals chapter 06 - 14.

You can do an intersection of the records in two set of rows:

>>> rows3 = rows1 & rows2

>>> print rows3

name

Tim

``:code

You can do a union of the records removing duplicates:

>>> rows3 = rows1 | rows2

>>> print rows3

name

Max

Tim

John

``:code

#### ``find``, ``exclude``, ``sort``

``find``:inxx ``exclude``:inxx ``sort``:inxx

Some times you need to perform two selects and one contains a subset of a previous select. In this case it is pointless to access the database again. The ``find``, ``exclude`` and ``sort`` objects allow you to manipulate a Rows objects and generate another one without accessing the database. More specifically:

- ``find`` returns a new set of Rows filtered by a condition and leaves the original unchanged.

- ``exclude`` returns a new set of Rows filtered by a condition and removes them from the original Rows.

Curt Boat

``:code

Similarly, you can search for all things owned by Alex:

>>> for row in persons_and_things(db.person.name=='Alex').select():

print row.thing.name

Boat

Chair

``:code

and all owners of Boat:

>>> for row in persons_and_things(db.thing.name=='Boat').select():

print row.person.name

Alex

Curt

``:code

### ``list:<type>`` and ``contains``

``list:string``:inxx

``list:integer``:inxx

``list:reference``:inxx

``contains``:inxx

``multiple``:inxx

``tags``:inxx

web2py provides the following special field types:

list:string

list:integer

list:reference <table>

``:code

They can contain lists of strings, of integers and of references respectively.

For lists of string the items are escaped so that any ``|`` in the item is replaced by a ``||``. Anyway this is an internal representation and it is transparent to the user.

This is best explained via some examples.

>>> db.person._before_delete.append(lambda s: pprint(s))

>>> db.person._after_delete.append(lambda s: pprint(s))

``:code

Here ``f`` is a dict of fields passed to insert or update, ``id`` is the id of the newly inserted record, ``s`` is the Set object used for update or delete.

>>> db.person.insert(name='John')

({'name': 'John'},)

({'name': 'John'}, 1)

>>> db(db.person.id==1).update(name='Tim')

(<Set (person.id = 1)>, {'name': 'Tim'})

>>> db(db.person.id==1).delete()

(<Set (person.id = 1)>,)

``:code

The return values of these callback should be ``None`` or ``False``. If any of the ``_before_*`` callback returns a ``True`` value it will abort the actual insert/update/delete operation.

``update_naive``:inxx

You can do a union of the records in two set of rows:

>>> rows3 = rows1 & rows2

>>> print rows3

name

-Max

-Tim

-John

Tim

``:code

You can do a union of the records removing duplicates:

>>> rows3 = rows1 | rows2

>>> print rows3

name

Max

Tim

John

``:code

#### ``find``, ``exclude``, ``sort``

``find``:inxx ``exclude``:inxx ``sort``:inxx

- ``find`` returns a new set of Rows filtered by a condition and leaves the original unchanged.

- ``exclude`` returns a new set of Rows filtered by a condition and removes them from the original Rows.

Curt Boat

``:code

Similarly, you can search for all things owned by Alex:

>>> for row in persons_and_things(db.person.name=='Alex').select():

print row.thing.name

Boat

Chair

``:code

and all owners of Boat:

>>> for row in persons_and_things(db.thing.name=='Boat').select():

print row.person.name

Alex

Curt

``:code

### ``list:<type>``, and ``contains``

``list:string``:inxx

``list:integer``:inxx

``list:reference``:inxx

``contains``:inxx

``multiple``:inxx

``tags``:inxx

web2py provides the following special field types:

list:string

list:integer

list:reference <table>

``:code

They can contain lists of strings, of integers and of references respectively.

For lists of string the items are escaped so that any ``|`` in the item is replaced by a ``||``. Anyway this is an internal representation and it is transparent to the user.

This is best explained via some examples.

>>> db.person._before_delete.append(lambda s: pprint(s))

>>> db.person._after_delete.append(lambda s: pprint(s))

``:code

Here ``f`` is a dict of fields passed to insert or update, ``id`` is the id of the newly inserted record, ``s`` is the Set object used for update or delete.

>>> db.person.insert(name='John')

({'name': 'John'},)

({'name': 'John'}, 1)

>>> db(db.person.id==1).update(name='Tim')

(<Set (person.id = 1)>, {'name': 'Tim'})

>>> db(db.person.id==1).delete()

(<Set (person.id = 1)>,)

``:code

The return values of these callback should be ``None`` or ``False``. If any of the ``_before_*`` callback returns a ``True`` value it will abort the actual insert/update/delete operation.

``update_naive``:inxx.

058dd92
- Date : 2014-02-07
- better TOC and links for ch 6 DAL

-----

**PostgreSQL** | ``postgres(_nonreserved)``

**MySQL** | ``mysql``

**FireBird** | ``firebird(_nonreserved)``

**MSSQL** | ``mssql``

**Oracle** | ``oracle``

-----

[[DAL_table_field]]

### ``DAL``, ``Table``, ``Field``

You can experiment with the DAL API using the web2py shell.

Start by creating a connection. For the sake of example, you can use SQLite. Nothing in this discussion changes when you change the back-end engine.

+[[dal_constructor]]

+#### DAL constructor

>>> db = DAL('sqlite://storage.db')

``:code

The database is now connected and the connection is stored in the global variable ``db``.

At any time you can retrieve the connection string.

``_uri``:inxx

>>> print db._uri

sqlite://storage.db

``:code

and the database name

``_dbname``:inxx

>>> print db._dbname

sqlite

``:code

The connection string is called a ``_uri`` because it is an instance of a Uniform Resource Identifier.

The DAL allows multiple connections with the same database or with different databases, even databases of different types. For now, we will assume the presence of a single database since this is the most common situation.

+[[table_constructor]]

+#### Table constructor

``define_table``:inxx ``Field``:inxx

``type``:inxx ``length``:inxx ``default``:inxx ``requires``:inxx ``required``:inxx ``unique``:inxx

``notnull``:inxx ``ondelete``:inxx ``uploadfield``:inxx ``uploadseparate``:inxx ``migrate``:inxx ``sql.log``:inxx

The most important method of a DAL is ``define_table``:

>>> db.define_table('person', Field('name'))

``:code

It defines, stores and returns a ``Table`` object called "person" containing a field (column) "name". This object can also be accessed via ``db.person``, so you do not need to catch the return value.

Do not declare a field called "id", because one is created by web2py anyway. Every table has a field called "id" by default. It is an auto-increment integer field (starting at 1) used for cross-reference and for making every record unique, so "id" is a primary key. (Note: the id's starting at 1 is back-end specific. For example, this does not apply to the Google App Engine NoSQL.)

``named id field``:inxx

Tables can be defined only once but you can force web2py to redefine an existing table:

db.define_table('person', Field('name'))

db.define_table('person', Field('name'), redefine=True)

``:code

The redefinition may trigger a migration if field content is different.

----------

Because usually in web2py models are executed before controllers, it is possible that some table are defined even if not needed. It is therefore necessary to speed up the code by making table definitions lazy. This is done by setting the ``DAL(...,lazy_tables=True)`` parameter. Tables will be actually created only when accessed.

----------

#### Adding attributes to fields and tables

If you need to add custom attributes to fields, you can simply do this:

``db.table.field.extra = {}``:code

"extra" is not a keyword ; it's a custom attributes now attached to the field object. You can do it with tables too but they must be preceded by an

underscore to avoid naming conflicts with fields:

``db.table._extra = {} ``:code

+[[record_representation]]

+#### Record representation

It is optional but recommended to specify a format representation for records:

>>> db.define_table('person', Field('name'), format='%(name)s')

``:code

>>> db.define_table('person', Field('name'), format='%(name)s %(id)s')

``:code

or even more complex ones using a function:

>>> db.define_table('person', Field('name'),

format=lambda r: r.name or 'anonymous')

``:code

The format attribute will be used for two purposes:

- To represent referenced records in select/option drop-downs.

+[[field_constructor]]

+#### Field constructor

``Field constructor``:inxx

These are the default values of a Field constructor:

Field(name, 'string', length=None, default=None,

required=False, requires='<default>',

ondelete='CASCADE', notnull=False, unique=False,

uploadfield=True, widget=None, label=None, comment=None,

writable=True, readable=True, update=None, authorize=None,

autodelete=False, represent=None, compute=None,

uploadfolder=os.path.join(request.folder,'uploads'),

uploadseparate=None,uploadfs=None)

``:code

- ``required`` tells the DAL that no insert should be allowed on this table if a value for this field is not explicitly specified.

- ``requires`` is a validator or a list of validators. This is not used by the DAL, but it is used by SQLFORM. The default validators for the given types are shown in the following table:

+[[field_types]]

+#### Field types

+``field types``:inxx

----------

**field type** | **default field validators**

``string`` | ``IS_LENGTH(length)`` default length is 512

``text`` | ``IS_LENGTH(65536)``

``blob`` | ``None``

``boolean`` | ``None``

``integer`` | ``IS_INT_IN_RANGE(-1e100, 1e100)``

``double`` | ``IS_FLOAT_IN_RANGE(-1e100, 1e100)``

``decimal(n,m)`` | ``IS_DECIMAL_IN_RANGE(-1e100, 1e100)``

``date`` | ``IS_DATE()``

``time`` | ``IS_TIME()``

``datetime`` | ``IS_DATETIME()``

``password`` | ``None``

``upload`` | ``None``

``reference <table>`` | ``IS_IN_DB(db,table.field,format)``

``list:string`` | ``None``

``list:integer`` | ``None``

``list:reference <table>`` | ``IS_IN_DB(db,table.field,format,multiple=True)``

``json`` | ``IS_JSON()``

``bigint`` | ``None``

will upload files to the "web2py/applications/myapp/static/temp" folder.

- ``widget`` must be one of the available widget objects, including custom widgets, for example: ``SQLFORM.widgets.string.widget``. A list of available widgets will be discussed later. Each field type has a default widget.

- ``label`` is a string (or a helper or something that can be serialized to a string) that contains the label to be used for this field in auto-generated forms.

- ``comment`` is a string (or a helper or something that can be serialized to a string) that contains a comment associated with this field, and will be displayed to the right of the input field in the autogenerated forms.

- ``writable`` declares whether a field is writable in forms.

- ``readable`` declares whether a field is readable in forms. If a field is neither readable nor writable, it will not be displayed in create and update forms.

- ``update`` contains the default value for this field when the record is updated.

- ``compute`` is an optional function. If a record is inserted or updated, the compute function will be executed and the field will be populated with the function result. The record is passed to the compute function as a ``dict``, and the dict will not include the current value of that, or any other compute field.

- ``authorize`` can be used to require access control on the corresponding field, for "upload" fields only. It will be discussed more in detail in the context of Authentication and Authorization.

- ``autodelete`` determines if the corresponding uploaded file should be deleted when the record referencing the file is deleted. For "upload" fields only.

- ``represent`` can be None or can point to a function that takes a field value and returns an alternate representation for the field value. Examples:

db.mytable.name.represent = lambda name,row: name.capitalize()

db.mytable.other_id.represent = lambda id,row: row.myfield

db.mytable.some_uploadfield.represent = lambda value,row: \

A('get it', _href=URL('download', args=value))

``:code

``blob``:inxx

"blob" fields are also special. By default, binary data is encoded in base64 before being stored into the actual database field, and it is decoded when extracted. This has the negative effect of using 25% more storage space than necessary in blob fields, but has two advantages. On average it reduces the amount of data communicated between web2py and the database server, and it makes the communication independent of back-end-specific escaping conventions.

+#### Run-time field and table modification

Most attributes of fields and tables can be modified after they are defined:

-----

**PostgreSQL** | ``postgres(_nonreserved)``

**MySQL** | ``mysql``

**FireBird** | ``firebird(_nonreserved)``

**MSSQL** | ``mssql``

**Oracle** | ``oracle``

-----

### ``DAL``, ``Table``, ``Field``

You can experiment with the DAL API using the web2py shell.

Start by creating a connection. For the sake of example, you can use SQLite. Nothing in this discussion changes when you change the back-end engine.

>>> db = DAL('sqlite://storage.db')

``:code

The database is now connected and the connection is stored in the global variable ``db``.

At any time you can retrieve the connection string.

``_uri``:inxx

>>> print db._uri

sqlite://storage.db

``:code

and the database name

``_dbname``:inxx

>>> print db._dbname

sqlite

``:code

The connection string is called a ``_uri`` because it is an instance of a Uniform Resource Identifier.

``define_table``:inxx ``Field``:inxx

``type``:inxx ``length``:inxx ``default``:inxx ``requires``:inxx ``required``:inxx ``unique``:inxx

``notnull``:inxx ``ondelete``:inxx ``uploadfield``:inxx ``uploadseparate``:inxx ``migrate``:inxx ``sql.log``:inxx

The most important method of a DAL is ``define_table``:

>>> db.define_table('person', Field('name'))

``:code

It defines, stores and returns a ``Table`` object called "person" containing a field (column) "name". This object can also be accessed via ``db.person``, so you do not need to catch the return value.

Do not declare a field called "id", because one is created by web2py anyway. Every table has a field called "id" by default. It is an auto-increment integer field (starting at 1) used for cross-reference and for making every record unique, so "id" is a primary key. (Note: the id's starting at 1 is back-end specific. For example, this does not apply to the Google App Engine NoSQL.)

``named id field``:inxx

Tables can be defined only once but you can force web2py to redefine an existing table:

db.define_table('person', Field('name'))

db.define_table('person', Field('name'), redefine=True)

``:code

The redefinition may trigger a migration if field content is different.

----------

#### Adding attributes to fields and tables

If you need to add custom attributes to fields, you can simply do this:

``db.table.field.extra = {}``:code

"extra" is not a keyword ; it's a custom attributes now attached to the field object. You can do it with tables too but they must be preceded by an

underscore to avoid naming conflicts with fields:

``db.table._extra = {} ``:code

-### Record representation