Computer-assisted chemical structure searching plays a critical role for efficient structure screening in cheminformatics. We designed a high-performance chemical structure & data search engine called DCAIKU, built on CouchDB and ElasticSearch engines. DCAIKU converts the chemical structure similarity search problem into a general text search problem to utilize off-the-shelf full-text search engines. DCAIKU also supports flexible document structures and heterogeneous datasets with the help of schema-less document database. Our evaluations show that DCAIKU can handle both keyword search and structural search against millions of records with both high accuracy and low latency. We expect that DCAIKU will lay the foundation towards large-scale and cost-effective structural search in materials science and chemistry research.
Skip Nav Destination
Article navigation
June 2018
Research Article|
June 01 2018
A high-performance and flexible chemical structure & data search engine built on CouchDB & ElasticSearch
Ren-zhi Li;
Ren-zhi Li
Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China
, Hefei 230026, China
Search for other works by this author on:
Bo-jie Li;
Bo-jie Li
Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China
, Hefei 230026, China
Search for other works by this author on:
Guo-zhen Zhang;
Guo-zhen Zhang
Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China
, Hefei 230026, China
Search for other works by this author on:
Jun Jiang;
Jun Jiang
*
Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China
, Hefei 230026, China
Search for other works by this author on:
Chin. J. Chem. Phys. 31, 341–349 (2018)
Article history
Received:
November 06 2017
Accepted:
December 25 2017
Citation
Ren-zhi Li, Bo-jie Li, Guo-zhen Zhang, Jun Jiang, Yi Luo; A high-performance and flexible chemical structure & data search engine built on CouchDB & ElasticSearch. Chin. J. Chem. Phys. 1 June 2018; 31 (3): 341–349. https://doi.org/10.1063/1674-0068/31/cjcp1711202
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Sign in via your Institution
Sign in via your InstitutionPay-Per-View Access
$40.00
Citing articles via
Related Content
Experimental data collection and data access software through internet at SPring-8
AIP Conference Proceedings (January 2019)
Optimization of water hyacinth utilization in bioethanol production by using cheminformatics approach
AIP Conference Proceedings (November 2018)
Single‐Molecule Biochemical Analysis Using Channel Current Cheminformatics
AIP Conference Proceedings (November 2005)
Simultaneous or separated; comparison approach for saccharification and fermentation process in producing bio-ethanol from EFB
AIP Conference Proceedings (November 2017)
Bond order predictions using deep neural networks
J. Appl. Phys. (February 2021)