(A) Explore and Expand () strategy used to index and collect MD-related files. Within the explore phase, we search in the respective data repositories for datasets that contain specific keywords …
(A) Distribution of files among MD simulation engines (B) Expansion of (A) MD Engine category ‘Unknown’ into the 10 most observed file types.
(A) Number of Gromacs-related files available in searched data repositories. In red, files used for further analyses. (B) Simple analyze of a subset of .xtc files with the cumulative distribution of …
(A) Cumulative distribution of .mdp files versus the simulation time for all-atom and coarse-grain simulations. (B) Sankey graph of the repartition between different values for thermostat and …
Data repository | datasets | first dataset | latest dataset | files | total size (GB) | zip files | files within zip | total files |
---|---|---|---|---|---|---|---|---|
Zenodo | 1011 | 19/11/2014 | 05/03/2023 | 20,250 | 12,851 | 1780 | 141,304 | 161,554 |
Figshare | 913 | 20/08/2012 | 03/03/2023 | 3336 | 736 | 590 | 74,720 | 78,056 |
OSF | 55 | 24/05/2017 | 05/02/2023 | 6146 | 495 | 14 | 0 | 6146 |
Total | 1979 | – | – | 29,732 | 14,082 | 2384 | 216,024 | 245,756 |