Presentation is loading. Please wait.

Presentation is loading. Please wait.

Structure Searching with STN Express

Similar presentations


Presentation on theme: "Structure Searching with STN Express"— Presentation transcript:

1 Structure Searching with STN Express

2 Introduction The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers discussing substances retrieved by a structure search are stored in the CAplus database. The L-number generated in the full-file structure search is the key to locating relevant references.

3 Why Do a Structure Search?
Introduction Why Do a Structure Search? There are a number of ways to locate information about a chemical compound: Chemical name Molecular formula Etc. Structure Each access point has unique advantages. Dictionary

4 Introduction Dictionary
Searching by dictionary is a powerful method for retrieving at least broad classes of substances in Registry, useful also in structure searching.

5 Introduction Dictionary - Chemical name
Chemical names may be systematic or trade names. However, systematic names can be complex and difficult to derive: Benzeneacetonitrile, α-[3-[[2-(3,4-dimethoxyphenyl)ethyl] methylamino]propyl]-3,4-dimethoxy-α-(1-methylethyl)-

6 Introduction Dictionary - Molecular formula
Many compounds have the same molecular formula: C27H38N2O4 is shared by >250 compounds in the CAS REGISTRY database

7 Introduction Dictionary - Chemical name, Molecular formula
Name, and Molecular formula, fragments, are inside the BI.

8 Introduction Structure Each compound has an unique structure

9 Structure-Searchable STN Databases
Introduction Structure-Searchable STN Databases A number of STN databases are searchable by structure. The REGISTRY database, produced by CAS, is the world's largest collection of chemical compounds.

10 Introduction A sample of structure-searchable databases on STN:

11 Basic Strategy for Structure Searching
Introduction Basic Strategy for Structure Searching Locating substance information via structure searching is accomplished in four main steps: Draw the structure Search the structure Display the structure matches retrieved Locate the desired information about the structure matches of interest

12 Structure Drawing

13 Structure Drawing Accessing the Structure Drawing Screen
The structure drawing screen is accessed via the STN Express main tool bar via the prepare query icon:

14 Structure Drawing Structure drawing screen has 7 main parts:

15 Structure Drawing

16 Structure Drawing Most chemical structures can be drawn using the following techniques: Drawing chains Drawing rings and ring systems Connecting rings and chains Specifying atoms Specifying shortcuts Specifying bonds Erasing and undoing

17 Structure Drawing When structures are drawn using STN Express tools, the following defaults apply: Atoms in the structure match the element in the current atom box. When the structure drawing screen is opened, the current atom is set to C (carbon). Bonds in the structure match the bond in the current bond box. When the structure drawing screen is opened, the current bond is set to single. PREFERENCES

18 Structure Drawing Drawing Chains

19 Drawing Rings and Ring Systems
Structure Drawing Drawing Rings and Ring Systems (continued on next slide)

20 Drawing Rings and Ring Systems
Structure Drawing Drawing Rings and Ring Systems

21 Drawing Rings and Ring Systems
Structure Drawing Drawing Rings and Ring Systems

22 Drawing Rings and Ring Systems
Structure Drawing Drawing Rings and Ring Systems

23 Structure Drawing

24 Connecting Rings and Chains
Structure Drawing Connecting Rings and Chains

25 Connecting Rings and Chains
Structure Drawing Connecting Rings and Chains

26 Structure Drawing Specifying Atoms
When structures are drawn with the pencil tool, ring tool, or chain tool, the atoms in the structure are whatever element appears in the current atom box. Atoms may be changed.

27 Structure Drawing Specifying Atoms

28 Structure Drawing Specifying Atoms

29 Structure Drawing Tips Inserting an Atom
When inserting an atom in a structure, look for an "A" to appear in the body of the pencil tool, indicating that the pencil tip is in correct position, directly over an atom. Illustration:

30 Structure Drawing Specifying Shortcuts
When structures are drawn with the pencil tool, ring tool, or chain tool, the atoms in the structure are whatever element appears in the current atom box. Atoms may be changed to any of the pre-drawn shortcuts.

31 Structure Drawing Specifying Shortcuts

32 Structure Drawing Specifying Bonds
When structures are drawn with the pencil tool, ring tool, or chain tool, the atoms in the structure are whatever element appears in the current atom box. Bonds may be changed.

33 Structure Drawing Specifying Bonds

34 Structure Drawing Specifying Bonds

35 Structure Drawing Correcting Mistakes

36 Structure Drawing Tips Resetting Default Settings for Atoms/Bonds
When "multiple use" is selected for an atom or shortcut, the default atom is changed to that selection and will remain in effect until defaults are reset by clicking the default atom/bond icon:

37 Structure Drawing Tips Changing atoms
Atoms can be changed by doing the following: Click the selection tool. Click to highlight the atoms to be changed (shift). Select the new atoms.

38 Displaying Carbon Atoms
Structure Drawing Displaying Carbon Atoms Carbon atoms in a structure can be displayed as To toggle between options, click the carbon display icon:

39 Structure Drawing After a structure is built, it must be saved prior to using it in an STN structure search

40 Structure Drawing Saving the Structure

41 Structure Drawing Saving the Structure X

42 Structure Drawing Verify the Structure:

43 Skills Practice Draw and save each structure below, using the techniques just discussed.

44 Structure Search Strategy

45 Structure Search Strategy
In this section, you will learn how to Determine the type of structure search to do, based on your information needs Upload a structure to STN Run a structure search in REGISTRY and display structure matches Locate CAplus references to structures of interest

46 Structure Search Strategy
Preview Structure searches can be run to locate Exact matches of a structure Structures containing a structural skeleton or fragment of interest Literature references discussing structures of interest The steps involved in all structure searches follow a similar order.

47 Structure Search Strategy
Preview To locate research papers discussing a structure of interest

48 Structure Search Strategy
Search Question: Locate references discussing the preparation of the following substance:

49 Structure Search Strategy
Step 1: Draw and save the structure Step 2: Logon to STN and enter a structure-searchable database (The FILE command is used to enter a structure-searchable database) FILE 'HOME' ENTERED => FILE REGISTRY

50 Structure Search Strategy
Step 3: Upload the structure

51 Structure Search Strategy
Step 3: Upload the structure

52 Structure Search Strategy
Step 4: Run a sample structure search The perspective of the search question is to locate exact matches on a given structure. A sample structure search searches a portion of the database. It is a no-cost option that is used to evaluate the effectiveness of a structure search by Testing the structure search to ensure it will run within system limits Verifying that the types of answers retrieved are the types of answers desired

53 Structure Search Strategy
The SEARCH command is used to run a sample structure search. At the command line, 3 additional pieces of information are required: A) L-number assigned to the structure during uploading

54 Structure Search Strategy
Type of structure search Three types of structure searches are possible, depending on the types of substance matches desired: Variability in bonds is allowed in all type of searches

55 Structure Search Strategy
Scope of the structure search Two scopes of structure searches are possible

56 Structure Search Strategy
=> S L1 EXACT SAM SAMPLE SEARCH INITIATED 13:45:28 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA EXA SAM L1 This search is projected to run to completion online within system limits. Structure matches ("answers") are placed in an answer set that is given the next available L-number.

57 Structure Search Strategy
Step 5: Evaluate structure matches The no-cost D SCAN feature is used to evaluate structure search results. D SCAN randomly selects an answer from the answer set and displays it:

58 Structure Search Strategy
=> D SCAN L2 2 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN Benzeneacetonitrile, a-[3-[[2-(3,4- dimethoxyphenyl)ethyl]methylamino]propyl]-3,4- dimethoxy-a-(1-methylethyl)-, (aS)- (9CI) MF C27 H38 N2 O4 CI COM Absolute stereochemistry. HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1 Structure matches are in line with the desired results. Specify here the number of additional answers you want to see.

59 Structure Search Strategy
L2 2 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN Benzeneacetonitrile, α-[3-[[2-(3,4-dimethoxyphenyl) ethyl]methylamino]propyl]-3,4-dimethoxy-α-(1- methylethyl)-, labeled with deuterium (9CI) MF C27 H34 D4 N2 O4 ALL ANSWERS HAVE BEEN SCANNED

60 Structure Search Strategy
Step 6: Run a full-file structure search A full-file structure search searches the entire database. The SEARCH command is also used to run a full-file structure search. At the command line, 3 additional pieces of information are required: L-number assigned to the structure during uploading Type of structure search Scope of the structure search - e.g., full

61 Structure Search Strategy
=> S L1 EXACT FULL FULL SEARCH INITIATED 13:50:30 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA EXA FUL L1 The results of a full-file structure search are placed in a new answer set.

62 Structure Search Strategy
Option: Evaluate answers => D SCAN L ANSWERS REGISTRY COPYRIGHT 2001 ACS IN Benzeneacetonitrile, a-[3-[[2-(3,4- dimethoxyphenyl)ethyl]methylamino]propyl]-3,4- dimethoxy-a-(1-methylethyl)-, labeled with deuterium (9CI) MF C27 H38 N2 O4 CI COM HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

63 Structure Search Strategy

64 Structure Search Strategy
Recall the Search Question: Locate references discussing the preparation of the following substance:

65 Structure Search Strategy
=> FILE CAPLUS => S L3/PREP 7690 L3 PREP/RL L L3/PREP (L3 (L) PREP/RL) Recall: L3 is the answer set resulting from the full-file structure search. Syntax: Type a slash and the CAS Role after the L-number The no-cost D SCAN feature is also available in CAplus. When the answer set contains more than one answer, D SCAN randomly selects an answer from the set and displays it. Evaluating results

66 Structure Search Strategy
Evaluating results => D SCAN L ANSWERS CAPLUS COPYRIGHT 2001 ACS    TI Investigation of radiochemical purity of 99mTc verapamil ST verapamil technetium 99m labeling radiochem purity IT DP, Verapamil, 99mTc-labeled DP, Technetium 99, verapamil labeled with, preparation RL: PNU (Preparation, unclassified); PRP (Properties); PREP (Preparation) (preparation and radiochem. purity of 99mTc verapamil) HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0 The CAS RN from the REGISTRY answer set that caused the answer to be retrieved is highlighted.

67 Displaying CAplus answers in more detail
Structure Search Strategy Displaying CAplus answers in more detail The DISPLAY command is used to show additional information for the answers retrieved from CAplus. Several predefined display formats are available:

68 Structure Search Strategy
Summary

69 Structure Search Strategy
Summary

70 Skills Practice Locate studies on the following substance, as well as stereoisomers and radiolabeled forms. Display the bibliographic information for the first 3 answers in the CAplus answer set. Locate references reporting the preparation of the following specific substance or any of its salts. Display the BIB, ABS, and HITSTR information for the first 3 answers in the CAplus answer set.

71 Substructure Searching

72 Skills Practice Using the structure in the Search Question, run the following searches: Exact full Family full Substructure full

73 Skills Practice Use D SCAN to look at the answers retrieved in each search. Note the different kinds of substances retrieved by each type of search.

74 Substructure Searching
In this section, you will learn how to draw structures for substructure searches to retrieve matches with the desired Ring system fusion Bonding patterns Substitution at open positions

75 Substructure Searching
Default Assumptions STN Express assigns attribute information to the rings and chains in a substructure query. These attributes determine the types of answers retrieved by a substructure search. Default assumptions can be modified.

76 Substructure Searching
Ring Systems The default assumption for ring systems is called "isolated/embedded". STN automatically searches for each ring system in a structure As drawn, without other rings attached to it via bonds (isolated) As part of a larger ring system, with other rings fused to it (embedded) An option exists to specify that a ring system in a structure be isolated.

77 Substructure Searching
Search Question: Locate substances with the following characteristics: Requirements: R and R' = anything, including hydrogen No substitution at the carbon in the ring marked with an asterisk (*) The ring may be further substituted at the other open sites, but there may be no additional rings fused at these positions

78 Building the Structure
Substructure Searching Building the Structure Don’t draw any atom Draw a Hydrogen Look at the following procedure

79 Substructure Searching
Ring isolation To prevent the substructure search from retrieving the ring system as part of a larger ring system, isolate the ring by doing the following:

80 Substructure Searching
Ring isolation

81 Substructure Searching
Ring isolation

82 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-ring.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR

83 Substructure Searching
=> S L1 SSS SAM (or only: => S L1) SAMPLE SEARCH INITIATED 14:49:50 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

84 Substructure Searching
=> D SCAN L2 1 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 3-Pyridinecarboxylic acid, 1-ethoxy-1,4-dihydro (phenylimino)-, ethyl ester (9CI) MF C16 H18 N2 O3 CI COM ALL ANSWERS HAVE BEEN SCANNED

85 Substructure Searching
=> S L1 SSS FULL (or only: => S L1 FULL) FULL SEARCH INITIATED 14:50:08 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1 => D SCAN L ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 3-Pyridinecarboxylic acid, 5-cyano-1,4-dihydro imino-6-methyl-1-(phenylmethyl)-, ethyl ester (9CI) MF C17 H17 N3 O2 HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

86 Skills Practice If this structure was drawn and Ring Isolation set to Isolated would the following substances be retrieved in a substructure search?

87 Skills Practice If this structure was drawn and Ring Isolation set to Isolated would the following substances be retrieved in a substructure search?

88 Skills Practice Run the search for this fragment twice:
One with the ring isolated One with the ring isolated embedded Look at the differences

89 Substructure Searching
Atoms Atoms in a substructure query may be Ring Chain Ring/Chain

90 Substructure Searching
Atoms The default assumption for atom retrieval depends on the location of an atom in the substructure query.

91 Substructure Searching
Search Question: Locate substances with the following characteristics: R, R', and R" = anything, including H The ring system may be substituted at all of the open sites There may be additional rings fused to the ring system R' and R" may form a ring with the N, e.g.,

92 Building the Structure
Substructure Searching Building the Structure Don’t draw any atom Rings isolated/embedded Look at the following procedure

93 Substructure Searching
Ring/chain atoms To allow the substructure search to retrieve a chain atom as part of a ring or a chain, do the following:

94 Substructure Searching
Ring/chain atoms

95 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-atom-rc.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR

96 Substructure Searching
=> S L1 SSS SAM SAMPLE SEARCH INITIATED 15:06:29 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

97 Substructure Searching
=> D SCAN L2 4 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 3-Quinolinecarboxylic acid, 4-(4-hydroxy methoxyphenyl)-6,7-dimethoxy-2-(1H-1,2,4-triazol ylmethyl)-, ethyl ester (9CI) MF C24 H24 N4 O6 HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1 Note: The N atom is part of a ring in this substructure match.

98 Substructure Searching
=> D SCAN L2 4 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 3-Quinolinecarboxylic acid, 2-[(dibutylamino) methyl]-4-(3,4-dimethoxyphenyl)-6,7-dimethoxy-, ethyl ester (9CI) MF C31 H42 N2 O6 HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0 Note: The N atom is part of a chain in this substructure match.

99 Substructure Searching
=> S L1 SSS FULL FULL SEARCH INITIATED 15:06:52 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1 => D SCAN L ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 3-Quinolinecarboxylic acid, 4-(3,4-dimethoxy phenyl)-6,7-dimethoxy-2-(4-thiomorpholinylmethyl)-, ethyl ester (9CI) MF C27 H32 N2 O6 S HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

100 Substructure Searching

101 Substructure Searching
Bonding Patterns Each bond in a substructure query has three characteristics associated with it: Bond type - Whether the bond is part of a ring or a chain Bond structure - Whether the bond is single, double, triple, or unspecified Bond value - Whether the bond is exact or part of an alternating single/double bond system (e.g., tautomers)

102 Substructure Searching
Bond Type Bonds in a substructure query may be Ring Chain Ring/Chain

103 Substructure Searching
Bond Type

104 Substructure Searching
Bond Type An option exists to specify that a bond type be ring/chain.

105 Substructure Searching
Search Question: Expand the previous query to retrieve substructure matches where the O atoms and the CH2—N chain are also parts of rings fused onto the quinoline ring. Requirements: There may be any type of substitution at all open sites There may be additional rings fused to the ring system

106 Building the Structure
Substructure Searching Building the Structure Don’t draw any atom Ring isolated/embedded Look at the following procedure

107 Substructure Searching
Bond type To allow the substructure search to retrieve a chain bond as part of a ring or a chain, do the following:

108 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-bond-rc.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR

109 Substructure Searching
=> S L1 SSS SAM SAMPLE SEARCH INITIATED 17:20:56 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 47.0% PROCESSED ITERATIONS ANSWERS INCOMPLETE SEARCH (SYSTEM LIMIT EXCEEDED) SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

110 Substructure Searching
=> S L1 SSS FULL FULL SEARCH INITIATED 17:23:15 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1 => D SCAN L ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 1,4-Dioxino[2,3-g]quinoline-8-carboxylic acid, (3,4-dimethoxyphenyl)-2,3-dihydro-7-( piperidinylmethyl)-, ethyl ester (9CI) MF C28 H32 N2 O6 HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

111 Skills Practice Have you found this answer?
Locate references discussing compounds containing the following substructure Requirements: Any of the atoms may be part of a ring There may be any type of substitution at any open site Have you found this answer?

112 Substructure Searching
Bond Structure Bonds in a substructure query may be drawn as Single Double Triple Unspecified

113 Substructure Searching
Bond Structure

114 Substructure Searching
Search Question: Locate substances with the following characteristics: Requirements: R is a carbon atom in a chain The bond may be single or double No additional ring fusion is desired on either ring system Any substitution may be present at all open sites

115 Building the Structure
Substructure Searching Building the Structure Draw a C atom Look at the following procedure Rings are isolated Don’t draw any atom

116 Substructure Searching
Bond structure To allow the substructure search to retrieve a bond that is single or double, do the following:

117 Substructure Searching
Bond structure

118 Substructure Searching
Bond structure

119 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-unspec-.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR Both ring systems were isolated.

120 Substructure Searching
=> S L1 SSS SAM SAMPLE SEARCH INITIATED 12:04:39 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

121 Substructure Searching
=> D SCAN L2 9 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN Isoquinoline, 1,2-dibenzoyl-6,7-diethoxy-1,2,3, tetrahydro- (7CI) MF C27 H27 N O4 HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1 Here, the bond to the O is a double bond. Note the double bonds in the fused ring!

122 Substructure Searching
L2 9 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 1-Isoquinolinemethanol, 1,2,3,4-tetrahydro-6, dimethoxy-2-methyl-a-phenyl-, [S-(R*,S*)]- (9CI) MF C19 H23 N O3 Absolute stereochemistry. HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0 Here, the bond to the O is a single bond.

123 Substructure Searching
=> S L1 SSS FULL FULL SEARCH INITIATED 12:05:40 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1

124 Substructure Searching
Bond Value Bonds in a substructure query may be Exact Normalized Exact/Normalized

125 Substructure Searching
Bond Value STN Express automatically assigns bond values based on characteristics of a substructure. It often assigns a bond to be exact or normalized to maximize the answers retrieved by a query. Generally, the bond values assigned by the software do not lead to undesirable answers.

126 Substructure Searching
Bond Value Run three exact full searches for the previous structure. With all exact bonds With all normalized bonds With all exact-normalized bonds Look at the differences

127 Substructure Searching
Bond Value With all exact bonds => l1 exa full FULL SEARCH INITIATED 09:27:14 FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED 4911 ITERATIONS 305 ANSWERS SEARCH TIME: L SEA EXA FUL L1 => l2 not ids/ci IDS/CI L L2 NOT IDS/CI

128 Substructure Searching
Bond Value With all exact bonds => d scan L ANSWERS REGISTRY COPYRIGHT 2003 ACS IN Cyclohexane-d7 (6CI, 7CI, 9CI) MF C6 H5 D7

129 Substructure Searching
Bond Value With all normalized bonds => l4 exa full FULL SEARCH INITIATED 09:29:16 FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA EXA FUL L4 => l5 not ids/ci IDS/CI L L5 NOT IDS/CI

130 Substructure Searching
Bond Value With all normalized bonds => d scan L ANSWERS REGISTRY COPYRIGHT 2003 ACS IN Benzene, tetracosamer, radical ion(1+) (9CI) MF (C6 H6)24 CI PMS, RIS CM 1

131 Substructure Searching
Bond Value With all exact-normalized bonds => l7 exa full FULL SEARCH INITIATED 09:30:29 FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA EXA FUL L7 => l2 or l5 L L2 OR L5 => l8 not l9 L L8 NOT L9

132 Substructure Searching
Bond Value With all exact-normalized bonds => d scan L ANSWERS REGISTRY COPYRIGHT 2003 ACS IN Cyclohexenylium-1-d (9CI) MF C6 H8 D

133 Substructure Searching

134 Substructure Searching
Bond value To change the bond values assigned by STN Express, do the following:

135 Normalized Bonds in REGISTRY
Substructure Searching Normalized Bonds in REGISTRY In rings, normalized bonds are used when Alternating single and double bonds can be drawn all the way around a ring There are an even number of atoms in the ring In chains (or part of rings), normalized bonds are used only for certain tautomers Structure drawing preview: STN Express determines normalized bonds as you draw.

136 Substructure Searching
Bond Value

137 Normalized bonds are used for certain tautomeric chains:
Substructure Searching Normalized bonds are used for certain tautomeric chains: 1 3 2 H Central atom 2 = C, N, P, As, Sb, S, Se, Te, Cl, Br, I Hetero atoms 1 and 3 = N, O, S, Se, Te One hetero atom must have a hydrogen (or D, T, or a charge)

138 Substructure Searching
Bond Value

139 Substructure Searching
Bond Value

140 Substructure Searching
Bond Value

141 Substructure Searching
Normalized bonds Not normalized bonds Examples Examples acids amines amides Keto-enol tautomers do not fit the REGISTRY tautomer normalized bond rule, so you have to build a query to find both variations.

142 Substructure Searching
Normalized bonds Not normalized bonds benzenes thiophenes thiazoles anthracenes cyclooctaquinolizinium Normalized is not synonymous with aromatic.

143 Substructure Searching
Please run a Full Exact search for the following structures and look at the differences. Exact bonds Normalized bonds Exact/Normalized bonds Exact bonds

144 Substructure Searching
If you draw with STN Express the following structure how the software considers the value bonds? Why?

145 Substructure Searching
If you run an Exact full search of this structure with all bonds Normalized you get 1 answer If you run an Exact full search of this structure with all bonds Exact/Normalized do you get the previous answer?

146 Do these rings have normalized bonds?
Substructure Searching Do these rings have normalized bonds?

147 Do these rings have normalized bonds?
Substructure Searching Do these rings have normalized bonds? YES YES 1 – 14 C’s in ring, alternating single and double 2 – 26 atoms in ring system, all alternating single and double bonds 3- 13 atoms in ring, single bonds in center ring NO single exact bonds are denoted

148 Substructure Searching
Search Question: STN Express assigned "exact/normalized" bonds to the highlighted bonds in the following structure: Eliminate the answers where the C6 rings contain alternating single and double bonds.

149 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-bond-value.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR

150 Substructure Searching
=> S L1 SSS SAM SAMPLE SEARCH INITIATED 14:02:02 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

151 Substructure Searching
=> D SCAN L2 3 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 1-Naphthalenemethanol, decahydro-a-[6-methoxy-2,3- bis(phenylmethoxy)phenyl]-2,5,5,8a-tetramethyl-, (1S,2S,4aS,8aS)- (9CI) MF C36 H46 O4 Absolute stereochemistry. HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

152 Skills Practice Run two searches for the following structures. No ring fusions, other double bonds, or substitutions, are allowed. No salts or mixture desired. Which kind of search?

153 => l1 exa full L

154 => l3 exa full L => l4 not l2 L

155 Skills Practice => l6 exa full L7 157
Can you run only one search for retrieving both the structures? Which searching structure? Which kind of search? => l6 exa full L

156 => l7 not l2 L L7 NOT L2 => l8 not l4 L L8 NOT L4

157 Skills Practice Run two different searches for the reported structures (Full SSS, STN Express default). - Are the results different? If yes,why?

158 => l1 full L

159 => l3 full L

160 => l2 and l4 L

161 => l2 not l4 and nc=1 L

162 => l4 not (l2 or m/els) and nc=1

163 Skills Practice Which structure search should be run in order to collect all the previous results?

164 => l7 full L

165 => l8 not (l2 or l4) L

166 Substructure Searching
BONDS Be carefull with cycles with odd number of atoms. Expecially if they contain hetero atoms. The bonds are not normalized usually (unless…). The position of double bonds depends on nomenclature. Which the best way for searching them?

167 Skills Practice Run two different searches for the reported structures (Full SSS, STN Express default). - Are the results different? If yes,why?

168 => l1 full L

169 => l3 full L

170 => l2 not l4 L L2 NOT L4

171 => l4 not l2 L L4 NOT L2

172 Skills Practice Which structure search should be run in order to collect all the previous results?

173 => l7 full L

174 => l8 not l2 L L8 NOT L2 => l9 not l4 L L9 NOT L4

175

176

177 Skills Practice Which is the less expensive search for retrieving both of the previous compounds?

178 Substructure Searching
BONDS BE CAREFUL WITH BONDS!

179 Substructure Searching
Extend

180 Substructure Searching
=> fil reg L STRUCTURE UPLOADED => d L1 HAS NO ANSWERS L STR 

181 Substructure Searching
=> l1 FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1 => l1 full FULL SEARCH INITIATED 06:07:43 FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1

182 Substructure Searching
=> set extend on perm SET COMMAND COMPLETED => l1 full FULL SEARCH INITIATED 06:08:38 L SEA SSS FUL L1 EXTEND CANDIDATE STRUCTURE SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1

183 Substructure Searching
=> d scan l4

184 Substructure Searching
Atom Substitution There are a number of ways to control atom substitution in substructure search matches:

185 Substructure Searching
Variable Atoms A number of system-defined options are available to allow variability in atom substitution:

186 Substructure Searching
Variable Atoms

187 Substructure Searching
Search Question: Locate substances with the following characteristics: Requirements: R is any type of heterocyclic ring, with any type of substitution R' is any non-hydrogen ring or chain substituent No additional ring fusion is desired on either ring system Any substitution may be present at all open sites

188 Building the Structure
Substructure Searching Building the Structure Look at the following procedure Draw a A variable (R/C) Rings are isolated Don’t draw any atom

189 Substructure Searching
Variable group To add a variable group to an existing structure, do the following:

190 Substructure Searching
Variable group

191 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-variable.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR The rings were isolated, and the node characteristic of A was set to ring/chain.

192 Substructure Searching
=> S L1 SSS SAM SAMPLE SEARCH INITIATED 16:29:55 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 38.8% PROCESSED ITERATIONS ANSWERS INCOMPLETE SEARCH (SYSTEM LIMIT EXCEEDED) SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

193 Substructure Searching
=> D SCAN L ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 2,5-Pyrrolidinedione, 1-[3-[4-(4-methoxyphenyl)-1- piperazinyl]propyl]-(9CI) MF C18 H25 N3 O3 CI COM HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0 A Hy

194 Substructure Searching
=> S L1 SSS FULL FULL SEARCH INITIATED 16:30:53 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1

195 Substructure Searching
G-Groups In addition to the STN system-defined variable groups (e.g., X), variable groups containing substituents of your own definition may be created. User-defined variable groups are called G-groups on STN. A G-group can have up to 20 substituents comprising it. Those substituents may be: Specific elements Shortcuts System-defined variable groups Structural fragments Other G-groups

196 Substructure Searching
Search Question: Locate substances with the following characteristics: Requirements: R = C, N R' = Me, Et, n-Pr, i-Pr, OH, SH, X No additional ring fusion is allowed on either of the 6-membered rings Additional substitution may be present at all open sites

197 Building the Structure
Substructure Searching Building the Structure Look at the following procedure Rings are isolated Don’t draw any atom

198 Substructure Searching
G-group To define and add a G-group to an existing structure, do the following:

199 Substructure Searching
G-group

200 Substructure Searching
G-group

201 Substructure Searching
G-group

202 Substructure Searching
G-group

203 Substructure Searching
G-group

204 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-ggroup.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR

205 Substructure Searching
=> S L1 SSS SAM SAMPLE SEARCH INITIATED 11:59:54 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

206 Substructure Searching
=> D SCAN L2 2 ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 4-Piperidinol, 1-[3-(6-fluoro-1,2-benzisoxazol yl)propyl]-4-(4-fluorophenyl)- (9CI) MF C21 H22 F2 N2 O2 HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0 G1 = F G2 = C, C

207 Variable Points of Attachment on Rings
Substructure Searching Variable Points of Attachment on Rings A substituent can be tagged to appear at variable positions on a ring. The substituent may be a Specific element Shortcut System-defined variable group Structural fragment G-group

208 Substructure Searching
Search Question: Locate substances with the following characteristics: Requirements: R is any type of heterocyclic ring with any type of substitution No additional ring fusion is desired on the phenyl or N-containing rings Any substitution may be present at all open sites There must be at least one halogen atom attached to the phenyl ring, at the positions marked with asterisks

209 Building the Structure
Substructure Searching Building the Structure Draw a Hy variable Rings are isolated Don’t draw any atom Look at the following procedure

210 Variable point of attachment
Substructure Searching Variable point of attachment To add a substituent with a variable point of attachment to an existing structure, do the following:

211 Variable point of attachment
Substructure Searching Variable point of attachment

212 Variable point of attachment
Substructure Searching Variable point of attachment

213 Substructure Searching
=> FILE REGISTRY => Uploading C:\Program Files\Stnexp\Queries\sss-vpa.str L STRUCTURE UPLOADED => D L1 L1 HAS NO ANSWERS L STR

214 Substructure Searching
=> S L1 SSS SAM SAMPLE SEARCH INITIATED 14:20:48 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED TO ITERATE 38.8% PROCESSED ITERATIONS ANSWERS INCOMPLETE SEARCH (SYSTEM LIMIT EXCEEDED) SEARCH TIME: FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE** PROJECTED ITERATIONS: TO PROJECTED ANSWERS: TO L SEA SSS SAM L1

215 Substructure Searching
=> D SCAN L ANSWERS REGISTRY COPYRIGHT 2001 ACS IN Pyrrolo[3,2-c]azepin-4(1H)-one, 5-[3-[4-( fluorophenyl)-1-piperazinyl]propyl]-5,6,7, tetrahydro-8-hydroxy-1-methyl- (9CI) MF C22 H29 F N4 O2 HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1 F at the para position

216 Substructure Searching
L ANSWERS REGISTRY COPYRIGHT 2001 ACS IN 1H-Indole-2,3-dione, 1-[3-[4-(3-chlorophenyl) piperazinyl]propyl]-, dihydrochloride (9CI) MF C21 H22 Cl N3 O2 . 2 Cl H HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0 => S L1 SSS FULL FULL SEARCH INITIATED 14:21:19 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED TO ITERATE 100.0% PROCESSED ITERATIONS ANSWERS SEARCH TIME: L SEA SSS FUL L1 Cl at the meta position

217 Substructure Searching
Summary

218 Substructure Searching
Summary

219 Skills Practice Are there any patents dealing with the preparation of compounds with the following structures? If so, who holds those patents? Requirements: R = Any carbocyclic ring system. All the carbocyclic rings may be identical or they may be different and they may be substituted or unsubstituted.

220 Skills Practice Locate papers discussing substances with the following structure. What have these substances been used for? Requirements: R = Ring or chain carbon with any type of substitution

221 Skills Practice What companies recently have been working on compounds with the following structure? HINT: Look at the CS (corporate source) index in the CAplus file record. Requirements: R = Hydrogen or any non-hydrogen substituent R' = Any non-hydrogen, non-carbon substituent, e.g., a keto functionality All other open sites may have any type of substitution Rings may have other rings fused to them

222 Skills Practice Locate papers discussing the synthesis of the following substances: Requirements: R = X, H, -NO2, -CF3 R' = Anything but hydrogen R" = C or N Any substitution may be present at all other open sites Phenyl ring shown in the structure may not have additional rings fused to it

223 Skills Practice Locate references discussing compounds containing the following structure fragment: Requirements: At least 2 -OH groups must be attached to the rings labeled "A" and "B” Any substitution at the remaining open sites Ring system may have other rings fused to it


Download ppt "Structure Searching with STN Express"

Similar presentations


Ads by Google