Chemistry Reference
In-Depth Information
Table 6.2
Amines and Acids from nci.structure Tables Selected in Groups
of 8 Amines and 12 Acids. Only three acids are shown in this truncated table.
Amines
Acids
CC(C)(C(=O)O)NC(=O)N(CCCl)N=O
c1cc(oc1)C(=O)C(=O)O
c1cc(ccc1C(=O)NC2(CC2)N3CCOCC3)Cl
c1cc(oc1)C(=O)C(=O)O
CCC(C)(NC)P(=O)(OCC)OCC
c1cc(oc1)C(=O)C(=O)O
Cc1ccccc1N=C(C(=C)Cl)NC(C)(C)C
c1cc(oc1)C(=O)C(=O)O
CNC1(CCCCC1)C(=O)O
c1cc(oc1)C(=O)C(=O)O
CCOC(=O)NC(C)(C)CO
c1cc(oc1)C(=O)C(=O)O
CC(C)(C)NCCS
c1cc(oc1)C(=O)C(=O)O
CC(C)C(C(=O)OC)(NC(=O)C)S
c1cc(oc1)C(=O)C(=O)O
CC(C)(C(=O)O)NC(=O)N(CCCl)N=O
CC(CCC(=O)O)N
c1cc(ccc1C(=O)NC2(CC2)N3CCOCC3)Cl
CC(CCC(=O)O)N
CCC(C)(NC)P(=O)(OCC)OCC
CC(CCC(=O)O)N
Cc1ccccc1N=C(C(=C)Cl)NC(C)(C)C
CC(CCC(=O)O)N
CNC1(CCCCC1)C(=O)O
CC(CCC(=O)O)N
CCOC(=O)NC(C)(C)CO
CC(CCC(=O)O)N
CC(C)(C)NCCS
CC(CCC(=O)O)N
CC(C)C(C(=O)OC)(NC(=O)C)S
CC(CCC(=O)O)N
CC(C)(C(=O)O)NC(=O)N(CCCl)N=O
CCC1C2CCC3=CC(=O)CCC3C2CCC1(C)C(=O)O
c1cc(ccc1C(=O)NC2(CC2)N3CCOCC3)Cl
CCC1C2CCC3=CC(=O)CCC3C2CCC1(C)C(=O)O
CCC(C)(NC)P(=O)(OCC)OCC
CCC1C2CCC3=CC(=O)CCC3C2CCC1(C)C(=O)O
Select logp From properties Where md5(logp) > md5(logp+1);
Select logp From properties Where md5(logp+1) < md5(logp);
Here, the
md5
function is a hash function available in PostgreSQL. It is
used as a method to partition the
logp
values in the
properties
table
into two arbitrary sets of about the same size. The less than operator
ensures exactly two sets, and the use of the
md5
function ensures that
the sets are arbitrary and of about the same size. Note that using
md5
results in arbitrary but not random sets. In other words, each time the
select statements above are run, exactly the same sets will result, as
long as no new rows are inserted. Rather than use this SQL statement
every time the test set is desired, a
test _ set
view and
training _
set
view can be defined as:
Create View test_set As Select smiles, logp From properties
Where md5(logp) > md5(logp+1);
Create View training_set As Select smiles, logp From properties
Where md5(logp+1) < md5(logp);
The view
test _ set
and
training _ set
can now be used as if they
were actual tables. If there are other criteria desired to define a test set or
Search WWH ::
Custom Search