Nearline storage

Nearline storage (a portmanteau of "near" and "online storage") is a term used in computer science to describe an intermediate type of data storage that represents a compromise between online storage (supporting frequent, very rapid access to data) and offline storage/archiving (used for backups or long-term storage, with infrequent access to data).[1][2]

Both archiving and nearline allow a reduction of database size that results in improved speed of performance for the online system. However, accessing archived data is more complex and/or slower than is the case with nearline storage, and can also negatively affect the performance of the main database, particularly when the archive data must be reloaded into that database.[3]

Nearline storage dates back to the IBM 3850 Mass Storage System (MSS) tape library, which was announced in 1974.[4]

Overview

The formal distinction between online, nearline, and offline storage is:[4]

For example, always-on spinning hard disk drives are online storage, while spinning drives that spin down automatically, such as in massive arrays of idle disks (MAID), are nearline storage. Removable media such as tape cartridges that can be automatically loaded, as in tape libraries, are nearline storage, while tape cartridges that must be manually loaded are offline storage.

Robotic nearline storage

A large tape library, with tape cartridges placed on shelves in the front, and a robotic arm moving in the back. Visible height of the library is about 180 cm.

The nearline storage system knows on which volume (cartridge) the data resides, and usually asks a robot to retrieve it from this physical location (usually: a tape library or optical jukebox) and put it into a tape drive or optical disc drive to enable access by bringing the data it contains online.[4] This process is not instant, but it only requires a few seconds.

Nearline tape and optical storage has the advantage of relatively longer lifespans compared to spinning hard drives, simply due to the storage media being idle and usually stored in protected dust-free enclosures when not in use. In a robotic tape loading system, the tape drive used for accessing data experiences the most wear and may need occasional replacement, but the tapes themselves can last for years to decades. If there are sealable access doors between the access mechanism and the media, it is possible for the idle media storage enclosure to survive fire, floods, lightning strikes, and other disasters.

Hard disk drive nearline storage

MAID (massive array of idle drives) systems archive data in an array of hard disk drives, with the most drives in a MAID usually stopped. The MAID system spins up each drive on demand when necessary to read (or in some cases to write) data on that drive. For a given amount of storage capacity, MAID systems have higher densities and lower power and cooling requirements than "hot" storage systems that keep all the disks spinning at full speed at all times.

Some hard drive and storage systems vendors and suppliers use the term in reference to low-rotational speed hard drives, that are built to be more reliable than generic desktop and laptop computer hard drives, and are intended to be operational continuously for 24 hours a day, seven days a week, possibly for several years.

Nearline hard drives may be used in personal or small business network-attached storage (NAS) systems, or as non-critical moderate-performance data storage on servers, where greater durability is required for the drive to operate continuously.

By comparison, standard hard drives are assumed to only be in operation for a few hours each day, and are not spinning when the computer is either turned off or in sleep mode. Standard hard drives may also use data caching methods that can improve single-drive performance, but would interfere with the operation of multi-drive RAID storage systems, potentially causing data loss or corruption.

Specifically the term nearline hard drive is being used to refer to high-capacity Serial ATA drives that work with Serial Attached SCSI storage devices. Presumably this usage is by analogy to the high-capacity and low-access speed tape systems.[5]

References

  1. "Nearline storage" in "A Glossary of Archival and Records Terminology". Retrieved on 2009-01-30.
  2. Venkatramani, Chitra and Tzi-cker Chiueh (1993). "Survey of Near-Line Storage Technologies: Devices and Systems". Experimental Computer Systems Laboratory.
  3. Ritchie, Arthur (2008), "Nearline and archiving in the data warehouse: what's the difference?". Database and Network Journal, October issue. Retrieved on 2012-08-28.
  4. 1 2 3 Pearson, Tony (2010). "Correct use of the term Nearline.". IBM Developerworks, Inside System Storage. Retrieved 2015-08-16.
  5. Seagate Technology Paper TP-543 (2005), . Retrieved on 2012-08-28.
This article is issued from Wikipedia - version of the Friday, December 18, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.