<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="/rss.css" type="text/css"?>
<rdf:RDF xmlns="http://purl.org/rss/1.0/"
    xmlns:cc="http://web.resource.org/cc/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:extra="http://www.w3.org/1999/xhtml"
    xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
    <channel rdf:about="http://www.jcheminf.com/feeds/mostaccessed/journal?quantity=&amp;format=rss&amp;version=">
        <title>Journal of Cheminformatics - Most accessed articles</title>
        <link>http://www.jcheminf.com</link>
        <description>The most accessed research articles published by Journal of Cheminformatics</description>
        <dc:date>2012-05-15T00:00:00Z</dc:date>
        <items>
            <rdf:Seq>
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/4/1/8" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/4/1/1" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/1/1/8" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/4/1/7" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/4/1/10" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/3/1/33" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/4/1/5" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/1/1/21" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/4/1/3" />
                                <rdf:li rdf:resource="http://www.jcheminf.com/content/1/1/3" />
                            </rdf:Seq>
        </items>
                 <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </channel>
        <item rdf:about="http://www.jcheminf.com/content/4/1/8">
        <title>Structure-based classification and ontology in chemistry</title>
        <description>Background:
Recent years have seen an explosion in the availability of data in the chemistry domain. With this information explosion, however, retrieving relevant results from the available information, and organising those results, become even harder problems. Computational processing is essential to filter and organise the available resources so as to better facilitate the work of scientists. Ontologies encode expert domain knowledge in a hierarchically organised machine-processable format. One such ontology for the chemical domain is ChEBI. ChEBI provides a classification of chemicals based on their structural features and a role or activity-based classification. An example of a structure-based class is &apos;pentacyclic compound&apos; (compounds containing five-ring structures), while an example of a role-based class is &apos;analgesic&apos;, since many different chemicals can act as analgesics without sharing structural features. Structure-based classification in chemistry exploits elegant regularities and symmetries in the underlying chemical domain. As yet, there has been neither a systematic analysis of the types of structural classification in use in chemistry nor a comparison to the capabilities of available technologies.
Results:
We analyze the different categories of structural classes in chemistry, presenting a list of patterns for features found in class definitions. We compare these patterns of class definition to tools which allow for automation of hierarchy construction within cheminformatics and within logic-based ontology technology, going into detail in the latter case with respect to the expressive capabilities of the Web Ontology Language and recent extensions for modelling structured objects. Finally we discuss the relationships and interactions between cheminformatics approaches and logic-based approaches.
Conclusion:
Systems that perform intelligent reasoning tasks on chemistry data require a diverse set of underlying computational utilities including algorithmic, statistical and logic-based tools. For the task of automatic structure-based classification of chemical entities, essential to managing the vast swathes of chemical data being brought online, systems which are capable of hybrid reasoning combining several different approaches are crucial. We provide a thorough review of the available tools and methodologies, and identify areas of open research.</description>
        <link>http://www.jcheminf.com/content/4/1/8</link>
                <dc:creator>Janna Hastings</dc:creator>
                <dc:creator>Despoina Magka</dc:creator>
                <dc:creator>Colin Batchelor</dc:creator>
                <dc:creator>Lian Duan</dc:creator>
                <dc:creator>Robert Stevens</dc:creator>
                <dc:creator>Marcus Ennis</dc:creator>
                <dc:creator>Christoph Steinbeck</dc:creator>
                <dc:source>Journal of Cheminformatics 2012, null:8</dc:source>
        <dc:date>2012-04-05T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-4-8</dc:identifier>
                                <prism:require>/content/figures/1758-2946-4-8-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>8</prism:startingPage>
        <prism:publicationDate>2012-04-05T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>PDF</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/4/1/1">
        <title>Making SharePoint(R) Chemically Aware(TM)</title>
        <description>Background:
The use of SharePoint&#174; collaboration software for content management has become a critical part of today&apos;s drug discovery process. SharePoint 2010 software has laid a foundation which enables researchers to collaborate and search on various contents. The amount of data generated during a transition of a single compound from preclinical discovery to commercialization can easily range in terabytes, thus there is a greater demand of a chemically aware search algorithm that supplements SharePoint which enables researchers to query for information in a more intuitive and effective way. Thus by supplementing SharePoint with Chemically Aware&#8482; features provides a great value to the pharmaceutical and biotech companies and makes drug discovery more efficient. Using several tools we have integrated SharePoint with chemical, compound, and reaction databases, thereby improving the traditional search engine capability and enhancing the user experience.
Results:
This paper describes the implementation of a Chemically Aware&#8482; system to supplement SharePoint. A Chemically Aware SharePoint (CASP) allows users to tag documents by drawing a structure and associating it with the related content. It also allows the user to search SharePoint software content and internal/external databases by carrying out substructure, similarity, SMILES, and IUPAC name searches. Building on traditional search, CASP takes SharePoint one step further by providing a intuitive GUI to the researchers to base their search on their knowledge of chemistry than textual search. CASP also provides a way to integrate with other systems, for example a researcher can perform a sub-structure search on pdf documents with embedded molecular entities.
Conclusion:
A Chemically Aware&#8482; system supplementing SharePoint is a step towards making drug discovery process more efficient and also helps researchers to search for information in a more intuitive way. It also helps the researchers to find information which was once difficult to find by allowing one to tag documents with molecular entities and integrating with image recognition software to find information from pdf documents.</description>
        <link>http://www.jcheminf.com/content/4/1/1</link>
                <dc:creator>Kartik Tallapragada</dc:creator>
                <dc:creator>Joseph Chewning</dc:creator>
                <dc:creator>David Kombo</dc:creator>
                <dc:creator>Beverly Ludwick</dc:creator>
                <dc:source>Journal of Cheminformatics 2012, null:1</dc:source>
        <dc:date>2012-01-12T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-4-1</dc:identifier>
                                <prism:require>/content/figures/1758-2946-4-1-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>1</prism:startingPage>
        <prism:publicationDate>2012-01-12T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/1/1/8">
        <title>Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions</title>
        <description>Background:
A method to estimate ease of synthesis (synthetic accessibility) of drug-like molecules is needed in many areas of the drug discovery process. The development and validation of such a method that is able to characterize molecule synthetic accessibility as a score between 1 (easy to make) and 10 (very difficult to make) is described in this article.
Results:
The method for estimation of the synthetic accessibility score (SAscore) described here is based on a combination of fragment contributions and a complexity penalty. Fragment contributions have been calculated based on the analysis of one million representative molecules from PubChem and therefore one can say that they capture historical synthetic knowledge stored in this database. The molecular complexity score takes into account the presence of non-standard structural features, such as large rings, non-standard ring fusions, stereocomplexity and molecule size. The method has been validated by comparing calculated SAscores with ease of synthesis as estimated by experienced medicinal chemists for a set of 40 molecules. The agreement between calculated and manually estimated synthetic accessibility is very good with r2 = 0.89.
Conclusion:
A novel method to estimate synthetic accessibility of molecules has been developed. This method uses historical synthetic knowledge obtained by analyzing information from millions of already synthesized chemicals and considers also molecule complexity. The method is sufficiently fast and provides results consistent with estimation of ease of synthesis by experienced medicinal chemists. The calculated SAscore may be used to support various drug discovery processes where a large number of molecules needs to be ranked based on their synthetic accessibility, for example when purchasing samples for screening, selecting hits from high-throughput screening for follow-up, or ranking molecules generated by various de novo design approaches.</description>
        <link>http://www.jcheminf.com/content/1/1/8</link>
                <dc:creator>Peter Ertl</dc:creator>
                <dc:creator>Ansgar Schuffenhauer</dc:creator>
                <dc:source>Journal of Cheminformatics 2009, null:8</dc:source>
        <dc:date>2009-06-10T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-1-8</dc:identifier>
                                <prism:require>/content/figures/1758-2946-1-8-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>8</prism:startingPage>
        <prism:publicationDate>2009-06-10T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/4/1/7">
        <title>CheS-Mapper - Chemical Space Mapping and Visualization in 3D</title>
        <description>Analyzing chemical datasets is a challenging task for scientific researchers in the field of chemoinformatics. It is important, yet difficult to understand the relationship between the structure of chemical compounds, their physico-chemical properties, and biological or toxic effects. To that respect, visualization tools can help to better comprehend the underlying correlations. Our recently developed 3D molecular viewer CheS-Mapper (Chemical Space Mapper) divides large datasets into clusters of similar compounds and consequently arranges them in 3D space, such that their spatial proximity reflects their similarity. The user can indirectly determine similarity, by selecting which features to employ in the process. The tool can use and calculate different kind of features, like structural fragments as well as quantitative chemical descriptors. These features can be highlighted within CheS-Mapper, which aids the chemist to better understand patterns and regularities and relate the observations to established scientific knowledge. As a final function, the tool can also be used to select and export specific subsets of a given dataset for further analysis.</description>
        <link>http://www.jcheminf.com/content/4/1/7</link>
                <dc:creator>Martin Gutlein</dc:creator>
                <dc:creator>Andreas Karwath</dc:creator>
                <dc:creator>Stefan Kramer</dc:creator>
                <dc:source>Journal of Cheminformatics 2012, null:7</dc:source>
        <dc:date>2012-03-17T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-4-7</dc:identifier>
                            <dc:title>Chemical Space Mapping and Visualization in 3D</dc:title>
                            <dc:description>Analyzing chemical datasets is an important task for understanding structure-property relationships. Visualization tools can help to better comprehend the correlations, and in this paper, an open-source tool for inspecting small molecule datasets is presented</dc:description>
                <prism:require>/content/figures/1758-2946-4-7-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>7</prism:startingPage>
        <prism:publicationDate>2012-03-17T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/4/1/10">
        <title>In-silico Predictive Mutagenicity Model Generation Using Supervised Learning Approaches</title>
        <description>Background:
Experimental screening of chemical compounds for biological activity is a time consuming and expensive practice. In silico predictive models permit inexpensive, rapid &quot;virtual screening&quot; to prioritize selection of compounds for experimental testing. Both experimental and in silico screening can be used to test compounds for desirable or undesirable properties. Prior work on prediction of mutagenicity has primarily involved identification of toxicophores rather than whole-molecule predictive models. In this work, we examined a range of in silico predictive classification models for prediction of mutagenetic properties of compounds, including methods such as J48 and SMO which have not previously been widely applied in cheminformatics.
Results:
The Bursi mutagenicity data set containing 4337 compounds (Set 1) and a Benchmark data set of 6512 compounds (Set 2) were taken as input data seta in this work. A third data set (Set 3) was prepared by joining up the previous two sets. Classification algorithms including Naive Bayes, Random Forest, J48 and SMO with 10 fold cross-validation and default parameters were used for model generation on these data sets.  Models built using the combined performed better than those developed from the Benchmark data set. Significantly, Random Forest outperformed other classifiers for all the data sets, especially for Set 3 with 89.27% accuracy, 89% precision and ROC of 95.3%. To validate the developed models two external data sets, AID1189 and AID1194, with mutagenicity data were tested showing 62% accuracy with 67% precision and 65% ROC area and 91% accuracy, 91% precision with 96.3% ROC area respectively. A Random Forest model was used the approved	 drugs from DrugBank and metabolites from the Zinc Database with True Positives rate almost 85% showing the robustness of the model.
Conclusion:
We have created a new mutagenicity benchmark data set with around 8,000 compounds. Our work shows that highly accurate predictive mutagenicity models can be built using machine learning methods based on chemical descriptors and trained using this set, and these models provide a complement to toxicophores based methods. Further, our work supports other recent literature in showing that Random Forest models generally outperform other comparable machine learning methods for this kind of application.</description>
        <link>http://www.jcheminf.com/content/4/1/10</link>
                <dc:creator>Abhik Seal</dc:creator>
                <dc:creator>Anurag Passi</dc:creator>
                <dc:creator>UC Abdul Jaleel</dc:creator>
                <dc:creator>David Wild</dc:creator>
                <dc:creator>OSDD Consortium</dc:creator>
                <dc:source>Journal of Cheminformatics 2012, null:10</dc:source>
        <dc:date>2012-05-15T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-4-10</dc:identifier>
                                <prism:require>/content/figures/1758-2946-4-10-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>10</prism:startingPage>
        <prism:publicationDate>2012-05-15T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>PDF</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/3/1/33">
        <title>Open Babel: An open chemical toolbox</title>
        <description>Background:
A frequent problem in computational modeling is the interconversion of chemical structures between different formats. While standard interchange formats exist (for example, Chemical Markup Language) and de facto standards have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chemistry data, differences in the data stored by different formats (0D versus 3D, for example), and competition between software along with a lack of vendor-neutral formats.
Results:
We discuss, for the first time, Open Babel, an open-source chemical toolbox that speaks the many languages of chemical data. Open Babel version 2.3 interconverts over 110 formats. The need to represent such a wide variety of chemical and molecular data requires a library that implements a wide range of cheminformatics algorithms, from partial charge assignment and aromaticity detection, to bond order perception and canonicalization. We detail the implementation of Open Babel, describe key advances in the 2.3 release, and outline a variety of uses both in terms of software products and scientific research, including applications far beyond simple format interconversion.
Conclusions:
Open Babel presents a solution to the proliferation of multiple chemical file formats. In addition, it provides a variety of useful utilities from conformer searching and 2D depiction, to filtering, batch conversion, and substructure and similarity searching. For developers, it can be used as a programming library to handle chemical data in areas such as organic chemistry, drug design, materials science, and computational chemistry. It is freely available under an open-source license from http://openbabel.org.</description>
        <link>http://www.jcheminf.com/content/3/1/33</link>
                <dc:creator>Noel O'Boyle</dc:creator>
                <dc:creator>Michael Banck</dc:creator>
                <dc:creator>Craig James</dc:creator>
                <dc:creator>Chris Morley</dc:creator>
                <dc:creator>Tim Vandermeersch</dc:creator>
                <dc:creator>Geoffrey Hutchison</dc:creator>
                <dc:source>Journal of Cheminformatics 2011, null:33</dc:source>
        <dc:date>2011-10-07T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-3-33</dc:identifier>
                            <dc:title>Open Babel: An open chemical toolbox</dc:title>
                            <dc:description>The first publication reporting the features, implementation and validation of the open source chemical toolbox - Open Babel - is described, which includes a summary of key advances in the 2.3 release</dc:description>
                <prism:require>/content/figures/1758-2946-3-33-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>33</prism:startingPage>
        <prism:publicationDate>2011-10-07T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/4/1/5">
        <title>Blind trials of Computer-Assisted Structure Elucidation software </title>
        <description>Background:
One of the largest challenges in chemistry today remains that of efficiently mining through vast amounts of data in order to elucidate the chemical structure for an unknown compound. The elucidated candidate compound must be fully consistent with the data and any other competing candidates efficiently eliminated without doubt by using additional data if necessary. It has become increasingly necessary to incorporate an in silico structure generation and verification tool to facilitate this elucidation process. An effective structure elucidation software technology aims to mimic the skills of a human in interpreting the complex nature of spectral data while producing a solution within a reasonable amount of time. This type of software is known as computer-assisted structure elucidation or CASE software. A systematic trial of the ACD/Structure Elucidator CASE software was conducted over an extended period of time by analysing a set of single and double-blind trials submitted by a global audience of scientists. The purpose of the blind trials was to reduce subjective bias. Double-blind trials comprised of data where the candidate compound was unknown to both the submitting scientist and the analyst. The level of expertise of the submitting scientist ranged from novice to expert structure elucidation specialists with experience in pharmaceutical, industrial, government and academic environments.
Results:
Beginning in 2003, and for the following nine years, the algorithms and software technology contained within ACD/Structure Elucidator have been tested against 112 data sets; many of these were unique challenges. Of these challenges 9% were double-blind trials. The results of eighteen of the single-blind trials were investigated in detail and included problems of a diverse nature with many of the specific challenges associated with algorithmic structure elucidation such as deficiency in protons, structure symmetry, a large number of heteroatoms and poor quality spectral data.
Conclusion:
When applied to a complex set of blind trials, ACD/Structure Elucidator was shown to be a very useful tool in advancing the computer&apos;s contribution to elucidating a candidate structure from a set of spectral data (NMR and MS) for an unknown. The synergistic interaction between humans and computers can be highly beneficial in terms of less biased approaches to elucidation as well as dramatic improvements in speed and throughput. In those cases where multiple candidate structures exist, ACD/Structure Elucidator is equipped to validate the correct structure and eliminate inconsistent candidates. Full elucidation can generally be performed in less than two hours; this includes the average spectral data processing time and data input.</description>
        <link>http://www.jcheminf.com/content/4/1/5</link>
                <dc:creator>Arvin Moser</dc:creator>
                <dc:creator>Mikhail Elyashberg</dc:creator>
                <dc:creator>Antony Williams</dc:creator>
                <dc:creator>Kirill Blinov</dc:creator>
                <dc:creator>Joseph DiMartino</dc:creator>
                <dc:source>Journal of Cheminformatics 2012, null:5</dc:source>
        <dc:date>2012-02-09T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-4-5</dc:identifier>
                                <prism:require>/content/figures/1758-2946-4-5-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>5</prism:startingPage>
        <prism:publicationDate>2012-02-09T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/1/1/21">
        <title>Virtual screening of bioassay data</title>
        <description>Background:
There are three main problems associated with the virtual screening of bioassay data. The first is access to freely-available curated data, the second is the number of false positives that occur in the physical primary screening process, and finally the data is highly-imbalanced with a low ratio of Active compounds to Inactive compounds. This paper first discusses these three problems and then a selection of Weka cost-sensitive classifiers (Naive Bayes, SVM, C4.5 and Random Forest) are applied to a variety of bioassay datasets.
Results:
Pharmaceutical bioassay data is not readily available to the academic community. The data held at PubChem is not curated and there is a lack of detailed cross-referencing between Primary and Confirmatory screening assays. With regard to the number of false positives that occur in the primary screening process, the analysis carried out has been shallow due to the lack of cross-referencing mentioned above. In six cases found, the average percentage of false positives from the High-Throughput Primary screen is quite high at 64%. For the cost-sensitive classification, Weka&apos;s implementations of the Support Vector Machine and C4.5 decision tree learner have performed relatively well. It was also found, that the setting of the Weka cost matrix is dependent on the base classifier used and not solely on the ratio of class imbalance.
Conclusions:
Understandably, pharmaceutical data is hard to obtain. However, it would be beneficial to both the pharmaceutical industry and to academics for curated primary screening and corresponding confirmatory data to be provided. Two benefits could be gained by employing virtual screening techniques to bioassay data. First, by reducing the search space of compounds to be screened and secondly, by analysing the false positives that occur in the primary screening process, the technology may be improved. The number of false positives arising from primary screening leads to the issue of whether this type of data should be used for virtual screening. Care when using Weka&apos;s cost-sensitive classifiers is needed - across the board misclassification costs based on class ratios should not be used when comparing differing classifiers for the same dataset.</description>
        <link>http://www.jcheminf.com/content/1/1/21</link>
                <dc:creator>Amanda Schierz</dc:creator>
                <dc:source>Journal of Cheminformatics 2009, null:21</dc:source>
        <dc:date>2009-12-22T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-1-21</dc:identifier>
                                <prism:require>/content/figures/1758-2946-1-21-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>21</prism:startingPage>
        <prism:publicationDate>2009-12-22T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/4/1/3">
        <title>LICSS - A chemical spreadsheet in Microsoft Excel</title>
        <description>Background:
Representations of chemical datasets in spreadsheet format are important for ready data assimilation and manipulation. In addition to the normal spreadsheet facilities, chemical spreadsheets need to have visualisable chemical structures and data searchable by chemical as well as textual queries. Many such chemical spreadsheet tools are available, some operating in the familiar Microsoft Excel environment. However, within this group, the performance of Excel is often compromised, particularly in terms of the number of compounds which can usefully be stored on a sheet.SummaryLICSS is a lightweight chemical spreadsheet within Microsoft Excel for Windows. LICSS stores structures solely as Smiles strings. Chemical operations are carried out by calling Java code modules which use the CDK, JChemPaint and OPSIN libraries to provide cheminformatics functionality. Compounds in sheets or charts may be visualised (individually or en masse), and sheets may be searched by substructure or similarity. All the molecular descriptors available in CDK may be calculated for compounds (in batch or on-the-fly), and various cheminformatic operations such as fingerprint calculation, Sammon mapping, clustering and R group table creation may be carried out.We detail here the features of LICSS and how they are implemented. We also explain the design criteria, particularly in terms of potential corporate use, which led to this particular implementation.
Conclusions:
LICSS is an Excel-based chemical spreadsheet with a difference:&#8226; It can usefully be used on sheets containing hundreds of thousands of compounds; it doesn&apos;t compromise the normal performance of Microsoft Excel&#8226; It is designed to be installed and run in environments in which users do not have admin privileges; installation involves merely file copying, and sharing of LICSS sheets invokes automatic installation&#8226; It is free and extensibleLICSS is open source software and we hope sufficient detail is provided here to enable developers to add their own features and share with the community.</description>
        <link>http://www.jcheminf.com/content/4/1/3</link>
                <dc:creator>Kevin Lawson</dc:creator>
                <dc:creator>Jonty Lawson</dc:creator>
                <dc:source>Journal of Cheminformatics 2012, null:3</dc:source>
        <dc:date>2012-02-02T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-4-3</dc:identifier>
                            <dc:title>A chemical spreadsheet in Excel</dc:title>
                            <dc:description>A lightweight open source chemical spreadsheet has been developed that runs within Microsoft Excel and can be used on sheets containing hundreds of thousands of compounds without compromising normal performance</dc:description>
                <prism:require>/content/figures/1758-2946-4-3-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>3</prism:startingPage>
        <prism:publicationDate>2012-02-02T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.jcheminf.com/content/1/1/3">
        <title>Computer-assisted methods for molecular structure elucidation: realizing a spectroscopist&apos;s dream</title>
        <description>Background:
This article coincides with the 40 year anniversary of the first published works devoted to the creation of algorithms for computer-aided structure elucidation (CASE). The general principles on which CASE methods are based will be reviewed and the present state of the art in this field will be described using, as an example, the expert system Structure Elucidator.
Results:
The developers of CASE systems have been forced to overcome many obstacles hindering the development of a software application capable of drastically reducing the time and effort required to determine the structures of newly isolated organic compounds. Large complex molecules of up to 100 or more skeletal atoms with topological peculiarity can be quickly identified using the expert system Structure Elucidator based on spectral data. Logical analysis of 2D NMR data frequently allows for the detection of the presence of COSY and HMBC correlations of &quot;nonstandard&quot; length. Fuzzy structure generation provides a possibility to obtain the correct solution even in those cases when an unknown number of nonstandard correlations of unknown length are present in the spectra. The relative stereochemistry of big rigid molecules containing many stereocenters can be determined using the StrucEluc system and NOESY/ROESY 2D NMR data for this purpose.
Conclusion:
The StrucEluc system continues to be developed in order to expand the general applicability, provide improved workflows, usability of the system and increased reliability of the results. It is expected that expert systems similar to that described in this paper will receive increasing acceptance in the next decade and will ultimately be integrated directly to analytical instruments for the purpose of organic analysis. Work in this direction is in progress. In spite of the fact that many difficulties have already been overcome to deliver on the spectroscopist&apos;s dream of &quot;fully automated structure elucidation&quot; there is still work to do. Nevertheless, as the efficiency of expert systems is enhanced the solution of increasingly complex structural problems will be achievable.</description>
        <link>http://www.jcheminf.com/content/1/1/3</link>
                <dc:creator>Mikhail Elyashberg</dc:creator>
                <dc:creator>Kirill Blinov</dc:creator>
                <dc:creator>Sergey Molodtsov</dc:creator>
                <dc:creator>Yegor Smurnyy</dc:creator>
                <dc:creator>Antony Williams</dc:creator>
                <dc:creator>Tatiana Churanova</dc:creator>
                <dc:source>Journal of Cheminformatics 2009, null:3</dc:source>
        <dc:date>2009-03-17T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1186/1758-2946-1-3</dc:identifier>
                                <prism:require>/content/figures/1758-2946-1-3-toc.gif</prism:require>
                <prism:publicationName>Journal of Cheminformatics</prism:publicationName>
        <prism:issn>1758-2946</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>3</prism:startingPage>
        <prism:publicationDate>2009-03-17T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <cc:License rdf:about="http://creativecommons.org/licenses/by/2.0/">
        <cc:permits rdf:resource="http://creativecommons.org/ns#Reproduction" />
        <cc:permits rdf:resource="http://creativecommons.org/ns#Distribution" />
        <cc:permits rdf:resource="http://creativecommons.org/ns#DerivativeWorks" />
    </cc:License>
</rdf:RDF>

