Once the glow of that milestone passes, the daunting task of identifying modulators, preferably low molecular weight substances, to interact with the proteins of these assays, is the inexorable activity of the company drug discovery engine.
Diversity clustering
Another efficient drug discovery screening strategy is to sample large library pools through the selection of molecules that are representative of a group (cluster) within this library. In this component of ChemDB, Jarvis-Patrick clustering based on fingerprints (bit strings) is utilized. By identifying the agent with the highest degree of similarity (centroid) to members of these clusters, as well as those agents that are chemically unique (singletons), a sampling of a large chemical pool is possible without the cost of purchasing the majority of these molecules. This approach also has the advantage of having cluster-members immediately available, if a centroid shows activity of interest. However, there is considerable flexibility in creating the cluster-singleton distribution with this method. The selected diversity parameters—threshold of acceptable dissimilarity of a cluster, and number of nearest neighbors in common—affect amount and size of clusters and the accompanying number of singletons. With some iteration the cluster statistics can be manipulated to provide a distribution that will fulfill sampling goals and provide reasonable homogeneity in cluster groups and a manageable number of singletons. The ChemDB software package includes this sampling procedure and TimTec's access to databases of hundreds of thousand of compounds provides libraries that approach the gamut of commercially available chemical diversity. This clustering approach, coupled with diversity strategy, can be a powerful means of library development for even large, mature collections.
Conclusions
Drug discovery companies implement a variety of approaches to their hit/lead-identification challenge. Through utilization of adaptable and effective software like ChemDB from TimTec Inc., rapid determination of compound parameters and expedient deployment of strategically useful programs can improve upon the efficiency of this process. Coupled with access to hundreds of thousands of molecules in various commercial databases, diversity libraries can be developed to meet the challenges of lead identification in the most demanding of situations. Library clustering also adds an efficient and practical dimension to this strategy that can increase cost effectiveness in both library establishment and archive collection augmentation.
Chemical management software promises diversity optimization
—Robert J. Chorvat, PhD
Director of Medicinal Chemistry Business Development, TimTec Inc., Newark, Del.
Drug Discovery, Reed Business Information, Morris Plains, NJ
Company | Quick Links | Resources | Customer Support | TimTec Network |
About Us Our Customers Our Partners Register & Login Contact Site Search |
Compound Libraries Natural products Buy Compounds Online Bioscreening Directory |
Glossary Structure Search Database Downloads FAQs |
Customer Service Terms of Sale Your Feedback (Un)Subscribe |
eChemStore eChemShop ActiMol.com MyriaScreen.com ChemDBsoft.com NaturalCompounds.org |