TY - JOUR
T1 - Large scale distributed data repository
T2 - Design of a molecular dynamics trajectory database
AU - Feig, Michael
AU - Abdullah, Matin
AU - Johnsson, Lennart
AU - Pettitt, B. Montgomery
N1 - Funding Information:
Vladimir A. Makarov is acknowledged for valuable discussions and suggestions. This work has in part been supported by the DoD Modernization program’s CHSSI component under contract TMC94-5801-0038-01 and by the National Partnership for Advanced Computational Infrastructure (NPACI) under cooperative agreement 10153082 with the University of Houston which is gratefully acknowledged.
PY - 1999/11
Y1 - 1999/11
N2 - The design of a molecular dynamics trajectory database is presented as an example of the organization of large-scale dynamic distributed repositories for scientific data. Large scientific datasets are usually interpreted through reduced data calculated by analysis functions. This allows a database architecture in which the analyzed datasets, that are kept in addition to the raw datasets, are transferred to a database user. A flexible user interface with a well defined Application Program Interface (API) allows for a wide array of analysis functions and the incorporation of user defined functions is a critical part of the database design. An analysis function is executed only when the requested analysis result is not available from an earlier request. A prototype implementation used to gain initial practical experiences with performance and scalability is presented.
AB - The design of a molecular dynamics trajectory database is presented as an example of the organization of large-scale dynamic distributed repositories for scientific data. Large scientific datasets are usually interpreted through reduced data calculated by analysis functions. This allows a database architecture in which the analyzed datasets, that are kept in addition to the raw datasets, are transferred to a database user. A flexible user interface with a well defined Application Program Interface (API) allows for a wide array of analysis functions and the incorporation of user defined functions is a critical part of the database design. An analysis function is executed only when the requested analysis result is not available from an earlier request. A prototype implementation used to gain initial practical experiences with performance and scalability is presented.
UR - http://www.scopus.com/inward/record.url?scp=0033340698&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0033340698&partnerID=8YFLogxK
U2 - 10.1016/S0167-739X(99)00039-4
DO - 10.1016/S0167-739X(99)00039-4
M3 - Article
AN - SCOPUS:0033340698
SN - 0167-739X
VL - 16
SP - 101
EP - 110
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
IS - 1
ER -