Changes between Version 6 and Version 7 of Ticket #7358, comment 3


Ignore:
Timestamp:
Aug 2, 2022, 2:07:13 PM (3 years ago)
Author:
Tom Goddard

Legend:

Unmodified
Added
Removed
Modified
  • Ticket #7358, comment 3

    v6 v7  
    1414
    1515The index for the database is split across several files based on the amount of memory.  On crick with 376 GB of memory the 214 million sequence index is 6 files, with 4 of size 126 Mbytes, and for 100 million sequences have 5 files with two 118 Mbytes and one 52 MB.  On minsky the 100 million sequences has index with 11 files, 9  being 34 Mbytes and one at 52 Mbytes.
     16
     17The disk read speed on minsky is slower than I thought.  It is a SATA drive, Samsung 870 QVO 4 TB, and reads at only 500 Mbytes/sec.  I wrote some simple C code that gave 0.52 GB/sec reading the first 100 million sequences of AlphaFold database 44 GB in 84 seconds.  To read the 337 GB mmseqs2 index for first 100 million sequences would take 643 seconds at that speed.