File System FAQ

This document is intended as a quick reference on basic questions about the SAM-QFS file system on the HPCVL Sun Fire cluster. It includes information on disk quota, our tape library backup system, scratch space and archiving schemes.

  1. How is the /home file system structured?

  2. Are there disk quota?

  3. What to do with files that I use infrequently?

  4. What do I do if I need a lot of disk space?

  5. How is scratch space handled on the cluster?

  6. Which files are backed up, which aren't?

  7. I lost data, how can I get them back?

  8. How do the SAM file systems work and how are they configured on the Sunfire cluster?

  9. How can I find out which of my files are on disk, and which ones are on tape?

  10. How can I manage my files manually?

Answers:

  1. How is the /home file system structured?

    The /home file systems (there is actually two of them, but they are both called /home) is the main area where users of the Sun Fire cluster keep their data. Each user's home directory resides there, and is called /home/hpcXXXX where hpcXXXX denotes the user name.

    Physically, this file system resides in 3 racks containing approximately 160 TB of RAW disk space in the form of 46 Sun StorageTek 3410 FC units. The building blocks of the file system consist of many RAID 5 logical volumes later mapped as LUN's to the SAM-QFS host. This configuration is designed to tolerate the failure of multiple disks without the loss of data or disruption in service. This is achieved with the implementation of a "global spare" pool of disks. When a member drive from any of the logical drive fails, the global spare drive joings the logical drive and automatically starts to rebuild.

    Logically, the file system is a SAM-QFS file system which which is a feature of the Sun Performance Suite (QFS) and the Sun Utilization Suite (SAM-FS) storage management packages. The former provides fast and efficient file access on clusters, while the latter offers dynamic archiving capabilities in the form of a so-called "Hierarchical Storage Management" environment that allows a close integration of our StorEdge L1400 tape library with the disk file system. This allows the automatic archiving of files, and the dynamic management of various copies of user files residing on disk and on tape (see below, question 6).

    The present version of the QFS files system is 4.5.

    Back to Top...

  2. Are there disk quota?

    Unlike it predecessor (3.5), the 4.5 version of QFS supports disk quota. The increased number of HPCVL users on the system has made it necessary to introduce quota. At present these quota have been fixed at 100 GB per user for /home, which is enough for 9 out of 10 of our users. These quota pertain to the /home file system only. The /u1 file system has a quota of 2 TBytes to avoid uncontrolled fill-ups ("sanity quota"). Data that exceed these limits have to be moved, preferably off the system (see below question 4).

    Note that disk quota are enforced without exception. Once a user exceeds these quota, no further data can be written to the file system by that user, making it impossible to log in in some cases. If this happens, you need to contact us and arrange for freeing up disk space.

    Files in /home and /u1 are automatically managed by the SAM-QFS file system. This includes backups. Users do not have to do anything for these activities occur.

    Back to Top...

  3. What to do with files that I use infrequently?

    Experience shows that users who need large amounts of disk space use a great portion of their files rather infrequently. An example would be the trajectory files of a molecular dynamics run, or the results of large fluid dynamics simulations. Files that contain information which is only rarely accessed may be moved away from the /home file system into an alternative area denoted /u1. This file system offers almost 28 TB of storage and is not subject to disk quota.

    Data residing in /u1 are backed up by default. /u1 should be used by creating a directory with the user-ID as directory name, and then restricting access to this directory to the owner only.
    mkdir /u1/work/hpcXXXX
    chmod 700 /u1/work/hpcXXXX
    Of course, hpcXXXX stands for the user-ID and has to be substituted. The structure of the files and directories below /u1/work/hpcXXXX is left to the user.

    SAM-QFS offers the possibility of "releasing" files, i.e. they will still be accessible under /u1/work/hpcXXXX, but whenever it is accessed it will be retrieved from tape, since the file does not physically reside on disk anymore (see below question 6). At present, we do not release files.

    Back to Top...

  4. What do I do if I need a lot of disk space?

    If you need more disk space than the disk quota on /home allows, you should consider the following options, preferably in that order:

    1. The easiest way to keep data safe and with fast access is to keep them in /home. Our disk quota are comparably high, and the presence of more than 100 GB of data should lead a user to consider "cleaning up" the /home area. We strongly encourage all our users to download their permanent data and back them up on external media such as multiple drives, CD-Rom, Tapes, or DVD. This is the safest way to make sure your data cannot be lost. In many cases the only reason why disk space usage approaches the limit is that unprocessed data are kept around unnecessarily. We do not supply individual backups or archiving facilities for longterm storage. This is the responsibility of the user.
    2. If the data are only temporary (eg, they are results that serve as input for other computations, but can then be discarded), they might be written out on scratch space. See question 5 below for further details about scratch space. Data on scratch space will not be archived or backed up, and and should be removed as soon as they are not needed anymore. We have at this point approximately 12 TB of scratch space available on the system. This is only a solution if the data will be removed soon (within days) after being generated.
    3. If the data are needed for a longer period of time, but can eventually be deleted, they can be brought to the afore-mentioned /u1 file system (see previous question). Data of this kind will be backed up for the (rather unlikely) event of a multiple disk failure. We recommend to consider this solution for data that need to be available for several weeks, but cannot be kept in the /home area to avoid exceeding the quota. Note that a system-wide size limit for backing up a single file is 25 GByte. Files larger than that will not be backed up.
    4. Finally, if you need to keep large amounts of data that are accessed rather frequently, exceed the disk quota, and need to be backed up, you need to contact us and make special arrangements. Such arrangement might imply Category 2 membership in HPCVL, or even necessitate re-negotiating the membership conditions.

    Back to Top...

  5. How is scratch space handled on the cluster?

    Scratch space is supplied in the /scratch area of the file system. This space is intended for transitory data that are generated during a calculation and are usually deleted shortly after the calculation has finished. However, it is worthwhile to consider keeping intermediate results that are only needed for a few days on scratch space if there is a danger of exceeding disk quota in /home.

    Note that our scratch space is now global, i.e. accessible from all nodes. While this implies somewhat slower access than local scratch, it allows data to be used from different nodes within a program run (e.g. of an MPI program), and it simplifies maintanance.

    Scratch space is accessed via the /scratch directory. To use scratch space, do the following:

    1. Create a directory with your user-ID as a name under /scratch on all nodes:
      mkdir /scratch/hpcXXXX
    2. Change the file permissions on the directory to allow access only to you:
      chmod 700 /scratch/hpcXXXX
    Here, XXXX stands for the numbers in your user-ID. To use the scratch, you will often have to set an application specific environment variable, which can then be given the name /scratch/hpcXXXX to work on all nodes, eg. for the quantum-chemistry code Gaussian, one would set (in a csh):
    setenv GAUSS_SCRDIR /scratch/hpcXXXX
    Note that the above settings are automatically applied when you are given a user account.

    Back to Top...

  6. Which files are backed up, which aren't?

    HPCVL maintains backups and short-medium-term archives for the purpose of securing user data, not for permanent storage or external use. This means that it is the responsibility of the individual user to remove data that are to be kept permanently from the cluster and store them on external media, such as disks, tapes, CD's or DVD's.

    User data that reside in the /home file system are backed up on a 15 minute cycle for the first copy, and on a 60 day cycle for the second, i.e. keep two backup copies for the case of disk failure. Backup data kept on our L1400 tape library. Since /home is using a SAM-QFS file system, this archival happens automatically as soon as a file appears on the files system. Whenever a file changes, the change will be comitted to the archive as well.

    Data that reside in /u1 are also backed up. The scanning period for the first copy there is 1 hour, while the one for the second copy is 60 days, just as for the /home file system. No data residing either in /home or /u1 are released at this point.

    All data outside these two areas are not backed up. This holds specifically for /tmp and /scratch.

    Back to Top...

  7. I lost data, how can I get them back?

    The general answer is contact us. The system administrator will be able to retrieve the lost data from the regular backup on the L1400 tape library. Keep in mind that changes that you made to the data before the loss occured might be lost since the copy of your file might be outdated. If the loss is the consequence of a general disk failure, the part of the file system that was affected will be restored from safety backups, and it is not necessary (nor useful) to contact the administrator for the retrieval of individual files. In that case, you will have to wait until the file system is restored to normal. This may take several days in the case of a severe failure.

    Back to Top...

  8. How do the SAM file systems work and how are they configured on the Sunfire cluster?

    The SAM file system was designed to optimize the utilization of limited disk space and at the same time provide a dynamic way to do archives and backups. To this end they provide an interface between the files that reside directly on disk, and a tape library such as our L1400.

    The files system will automatically create copies of files that are freshly written to disk or that are altered. This is to safeguard from disk failures and serves as a standard backup. Additionally, the files system offers the possibility of making further copies on tape, and to keep files that have not been used for an extended period of time on tape altogether.

    The latter process is called releasing files. Data that have not been accessed for a certain time period may be copied to tape (preferably, more than one copy is made for safety), and then the copy on disk is replaced by a stub, which serves a similar function as inodes in Unix file systems, i.e. it contains information about where the data reside, their size, the date they have been accessed last, etc. However, the data themselves do not reside on disk anymore, and if they are to be accessed they will be retrieved automatically from tape. Often, the user will prefer to retrieve them manually before attempting to access them.

    Retrieving released data back to disk is called staging, and is either done automatically when the data are accessed, or preferably, by issuing a command beforehand. The staging process can take between minutes for a few small files that are on the same tape, when the archiving system is not busy, to several hours, when large data spread over several tapes are required at a time when several requests for tape operations are queued.

    The HPCVL Sunfire cluster has at this time two SAM file systems: /home (more accurately, /home1 and /home2), and /u1. Neither of these two systems is currently releasing any files. Here are the backup cycles for these areas:

    • /home: All files residing in the /home area, ie, in the user home directories, are handled by SAM. A first copy of the files is made every approximately 15 minutes after the file is created or altered. A second copy is produced every 60 days. No releasing takes place on /home, i.e. the data are always kept on disk. Note that /home is subject to disk quota of 100 GByte per user to make this policy possible.
    • /u1: Additional disk space is provided in the /u1 area. Note that data need to be placed in a specific directory: /u1/work/hpcXXXX. Data in /u1/work/hpcXXXX are handled as follows: The first copy is backed up on a one-hour cycle, whereas the second copy is on a 60-days cycle. No data are released at present. Note that a 2 TByte "sanity quota" is in effect on /u1.
    Note that the configuration of the SAM file systems might change in the future. Check this FAQ to keep up-to-date.

    Back to Top...

  9. How can I find out which of my files are on disk, and which ones are on tape?

    None of the files on our file system is currently subject to releasing, i.e. all file physically reside on disk.

    Back to Top...

  10. How can I manage my files manually?

    Many file operations such as releasing and staging are done by the SAM file system automatically, and have been configured by the system administrator. Certain basic settings (such as releasing files within some file systems) cannot be overridden by users, to maintain the integrity of the file system.

    There are commands that let the user control their own files to some degree. However, at present, these commands cannot be accessed from the login zone. This may change in the future. Please check this FAQ file occassionally for updates.

    If you are unsure about the state of your files, contact us .

    Back to Top...

 
 
   
© HPCVL 2007