In computer storage, a NetApp filer is a storage system product by NetApp. It can serve storage over a network using file-based protocols such as NFS, SMB, FTP, TFTP, and HTTP. Filers can also serve data over the block-based protocol FCP, using a Fiber channel, Fibre Channel over Ethernet (FCoE), or iSCSI transport layer.
The product is also known as NetApp Fabric-Attached Storage (FAS) and NetApp All Flash FAS (AFF)
NetApp Filers implement their physical storage in large disk arrays.
While most large-storage filers are implemented with commodity computers with an operating system such as Microsoft Windows Server, VxWorks or tuned Linux, NetApp filers use highly customized hardware and the proprietary Data ONTAP operating system with WAFL file system, all originally designed by NetApp founders David Hitz and James Lau specifically for storage-serving purposes. Data ONTAP is NetApp's internal operating system, specially optimised for storage functions at high and low level. It boots from FreeBSD as a stand-alone kernel-space module and uses some functions of FreeBSD (command interpreter and drivers stack, for example).
All filers have battery-backed non-volatile random access memory or NVDIMM, referred to as NVRAM or NVDIMM, which allows them to commit writes to stable storage more quickly than traditional systems with only volatile memory. Early filers connected to external disk enclosures via parallel SCSI, while modern models (as of 2009) use fibre channel and SAS (Serial Attach SCSI) SCSI transport protocols. The disk enclosures (shelves) use fibre channel hard disk drives, as well as parallel ATA, serial ATA and Serial attached SCSI. Starting with AFF A800 NVRAM PCI card no longer used for NVLOGs, it was replaced with NVDIMM memory directly connected to memory bus.
Implementers often organize two filers in a high-availability cluster with a private high-speed link, either Fibre Channel, InfiniBand, 10 Gigabit Ethernet, 40 Gigabit Ethernet or 100 Gigabit Ethernet. One can additionally group such clusters together under a single namespace when running in the "cluster mode" of the Data ONTAP 8 operating system.
Video NetApp filer
Internal architecture
Modern NetApp filers consist of customized computers with Intel processors using PCI. Each filer has non-volatile random access memory, called NVRAM, in the form of a proprietary PCI NVRAM adapter or NVDIMM-based memory, to log all writes for performance and to play the data log forward in the event of an unplanned shutdown. One can link two filers together as a cluster, which NetApp (as of 2009) refers to using the less ambiguous term "Active/Active".
Maps NetApp filer
Hardware
Each filer model comes with a set configuration of processor, RAM, and non-volatile memory, which users cannot expand after purchase. With the exception of some of the entry point storage controllers, the NetApp filers have at least one PCIe-based slot available for additional network, tape and/or disk connections. In June 2008 NetApp announced the Performance Acceleration Module (or PAM) to optimize the performance of workloads which carry out intensive random reads. This optional card goes into a PCIe slot and provides additional memory (or cache) between the disk and the filer RAM and system memory, thus improving performance.
All Flash FAS
Also known as AFF A-series. Usually AFF systems based on the same hardware as FAS but first one optimized and works only with SSD drives on back end, for example AFF A700 & FAS9000, A300 & FAS8200, A200 & FAS2600, A220 & FAS2700 use the same hardware but AFF systems does not include Flash Cache cards. Also AFF systems does not support FlexArray third party storage array virtualization functionality. Both AFF & FAS using same firmware image and nearly all noticeable functionality for end user are the same for both storage systems. However internally data processed and handled differently in ONTAP on AFF systems for example used different Write Allocation algorithms then on FAS systems. Because AFF systems have faster underlying SSD drives Inline data deduplication in ONTAP systems nearly not noticeable (about 2% performance impact on low end systems).
Storage
NetApp uses either SATA, Fibre Channel, SAS or SSD disk drives, which it groups into RAID (Redundant Array of Inexpensive Disks or Redundant Array of Independent Disks) groups of up to 28 (26 data disks plus 2 parity disks). NetApp FAS storage systems which contain only SSD drives with installed SSD-oprimzed ONTAP OS called All Flash FAS (AFF).
Disks
FAS and AFF filers are using enterprise level HDD and SSD (i.e. NVMe SSD) drives with two ports, each port connected to each controller in an HA pare. HDD and SSD drives can be bought only from NetApp and installed in NetApp's Disk Shelves for FAS/AFF platform. Psychical HDD and SSD drives, partitions on them and LUNs imported from third party arrays with FlexArray functionality considered in ONTAP as a Disk. In SDS systems like ONTAP Select & ONTAP Cloud a logical block storage like virtual disk or RDM inside ONTAP also considered as a Disk. Do not confuse general term "disk drive" and "disk drive term used in ONTAP system" because with ONTAP it could be entire physical HDD or SSD drive, an LUN or a partition on a physical HDD or SSD drive. LUNs imported from third party arrays with FlexArray functionality in HA pair configuration must be accessible from both nodes of the HA pair. Each disk have ownership on it to show which controller own and serve the disk. An Aggregates can include only disks owned by a single node, therefore each aggregate owned by a node and any upper objects as FlexVol volumes, LUNs, File Shares are served with a single controller. Each controller can have its own disks and aggregates an serve them where both nodes can be utilized simultaneously even though they not serving the same data.
RAID
RAID and WAFL in ONTAP systems tightly integrated. There are few RAID types available with NetApp FAS / AFF systems: RAID-4 with 1 dedicated parity drive, allows any 1 drive to fail in a RAID group; RAID-DP US patent 7409625 with 2 dedicated parity drives, allows any 2 drives to fail simultaneously in a RAID group, RAID-TEC US patent 7640484 with 3 dedicated parity drives, allows any 3 drives to fail simultaneously in a RAID group . RAID-DP similar to RAID-6 because have same resiliency of 2 disk drives but all the NetApp's RIADs have dedicated parity disks and with combination of NetApp implementation of non-volatile memory and WAFL characteristic to allays write to new place dedicated parity disks are never bottlenecks compare to traditional RAID-4 and RAID-6 on write/rewrite operations. Each aggregate consist of one or two plexes, an plex consists of one or more RAID groups. Typical NetApp FAS or AFF storage system have only 1 plex in each aggregate, two plexes used in local SyncMirror or MetroCluster configurations. Each RAID group consists of disk drives of same type, speed, geometry and capacity. Though NetApp Support could allow a user to install a drive to an RAID group with same or bigger size and different type, speed and geometry for temporary basis. Ordinary data aggregates if containing more than one RAID group must have same RAID groups across the aggregate, same RAID group size is recommended, but NetApp allows to have exception in last RAID group and configure it as small as half of the RAID group size across aggregate. For example such an aggregate might consists of 3 RAID groups: RG0:16+2, RG1:16+2, RG2:7+2. Aggregates enabled as FlshPool and with both HDD and SSD drives called hybrid aggregates. In Flash Pool hybrid aggregates same rules applied to the hybrid aggregate as to ordinary aggregates but separately to HDD and SSD drives, thus it is allowed to have two different RAID types: only one RAID type for all HDD drives and only one RAID type for all SSD drives in a single hybrid aggregate. For example SAS HDD with RAID-TEC (RG0:18+3, RG1:18+3) and SSD with RAID-DP (RG3:6+2). NetApp filers combine underlying RAID groups similarly to RAID-0. Also in NetApp FAS systems with FlexArray feature third party LUNs could be combined in a Plex similarly to RAID-0. NetApp filers systems can be deployed in MetroCluster and SyncMirror configurations which are using technique comparably to RAID-1 with mirroring data between two plexes in an aggregate.
Flash Pool
NetApp Flash Pool is a feature on hybrid NetApp FAS systems allows create hybrid aggregate with HDD drives and SSD drives in a single data aggregate. Both HDD and SSD drives form separate RAID groups. Since SSD used also write operations it require RAID redundancy contrary to Flash Cache but allows to use different RAID types for HDD and SSD for example it is possible to have 20 HDD 8TB in RAID-TEC while 4 SSD in RAID-DP 960GB in a single aggregate. SSD RAID used as cache and improve performance for read-write operations for FlexVol volumes on the aggregate where SSD added as cache. Flash Pool cache similarly to Flash Cache have policies for read operations but also include write operations which could apply separately for each FlexVol volume located on the aggregate, thus could be disabled on some volumes wile others could benefit from SSD cache. To enable an aggregate with Flash Pool technology minimum 4 SSD disks required (2 data, 1 parity and 1 hot spare), it is also possible to use ADP technology to partition SSD into 4 pieces (Storage Pool) and distribute those pieces between two controllers so each controller will benefit from SSD cache when there is small amount of SSD. Flash Pool is not available with FlexArray and is available only with NetApp FAS native disk drives in NetApp's disk shelves.
FlexArray
FlexArray is NetApp FAS functionality allows to visualize third party storage systems and other NetApp storage systems over SAN protocols and use them instead of NetApp's disk shelves. With FlexArray functionality RAID protection must be done with third party storage array thus NetApp's RAID-4, RAID-DP and RAID-TEC not used in such configurations. One or many LUNs from third party arrays could be added to a single aggregate similarly to RAID-0. FlexArray is licensed feature.
Security
ONTAP OS have number of features to increase security on the storage system like Onboard Key Manager, passphrase for controller boot with NSE & NVE encryption and USB key manager (available starting with 9.4).
NetApp Storage Encryption
NetApp Storage Encryption (NSE) is using specialized purpose build disks with low level Hardware-based full disk encryption (FDE/SED) feature, compatible nearly with all NetApp ONTAP features and protocols but does not offer MetroCluster. NSE feature does overall nearly zero performance impact on storage system. NSE feature similarly to NetApp Volume Encryption (NVE) in filers can store encryption key locally in Onboard Key Manager or on dedicated key manager systems using KMIP protocol like IBM Security Key Lifecycle Manager and SafeNet KeySecure. NSE is data at rest encryption which means it protects only from physical disks theft and does not give additional level of data security protection in normally operational and running system.
PAM / Flash Cache
NetApp Filer can have PAM ( Performance Accelerate Module ) or Flash Cache (PAM II) which can reduce read latencies and allows the filer to process more read intensive work without adding any further disk to the underlying RAID since read operations do not require redundancy in case of Flash Cache failure. Flash Cache works on controller level and accelerate only read operations. Each separate volume on the controller can have different caching policy or read cache could be disabled for a volume. Flash Cache caching policies applied on FlexVol level. Flash Cache technology compatible with FlexArray feature.
MetroCluster
MetroCluster (MC)s free functionality for FAS and AFF systems for metro high availability with synchronous replication between two sites, this configuration require additional equipment. MetroCluster uses SyncMirror and plex technique where on one site number of disks form one or more RAID groups aggregated in a plex, while on the second site have same number of disks with same type and RAID configuration. One plex synchronously replicates to another in compound with non-volatile memory. Two plexes form an aggregate where data stored and in case of disaster on one site second site provide read-write access to data. MetroCluster Support FlexArray technology. MetroCluster configurations are possible only with mid-range and high-end models which provide ability to install additional network cards required to MC to function.
Clustered MetroCluster
With MetroCluster it is possible to have one or more storage node per site to form a cluster or Clustered MetroCluster (MCC). Remote and local HA perter node must be same model. MCC consists of two clusters each located on one of two sites. There may be only two sites. In MCC configuration each one remote and one local storage node form Metro HA or Disaster Recovery Pare (DR Pare) across two sites while two local nodes (if there is partner) form local HA pare, thus each node synchronously replicates data in non-volatile memory two nodes: one remote and one local (if there is one). It is possible to utilize only one storage node on each site (two single node clusters) configured as MCC. 8 node MCC consists of two clusters - 4 node each (2 HA pare), each storage node have only one remote partner and only one local HA partner, in such a configuration each site clusters can consists out of two different storage node models. For small distances MetroCluster require at least one FC-VI or newer iWARP card per node. FAS and AFF systems with ONTAP software versions 9.2 and older utilize FC-VI cards and for long distances require 4 dedicated Fibre Channel switches (2 on each site) and 2 FC-SAS bridges per each disk shelf stack, thus minimum 4 total for 2 sites and minimum 2 dark fiber ISL links with optional DWDMs for long distances. Data volumes, LUNs and LIFs could online migrate across storage nodes in the cluster only withing a single site where data originated from: it is not possible to migrate individual volumes, LUNs or LIFs using cluster capabilities across sites unless MetroCluster switchover operation is used which disable entire half of the cluster on a site and transparently to it's clients and applications switch access to all of the data to another site.
MetroCluster over IP
Starting with ONTAP 9.3 MetroCluster over IP was introduced with no need for dedicated back-end Fibre Channel switches, FC-SAS bridges and dedicated dark fiber ISL. MetroCluster over IP require Ethernet cluster switches with installed ISL and utilize iWARP cards in each storage controller for synchronous replication.
Data ONTAP OS
NetApp filers using proprietary OS called ONTAP (Previously Data ONTAP). Main purpose for OS in a storage system is to serve data to clients in non-disruptive manner with data protocols like CIFS, NFS, iSCSI, Fiber Channel, NVMe over Fabrics (NVMe-oF, currently only FC-NVMe supported) and to provide enterprise features like High Availability, Disaster Recovery and data Backup. ONTAP OS provide enterprise level data management features like FlexClone, SnapMirror, SnapLock, MetroCluster etc, most of them snapshot-based WAFL File System capabilities.
WAFL File System
WAFL, as a robust versioning filesystem in NetApp's proprietary OS ONTAP, it provides snapshots, which allow end-users to see earlier versions of files in the file system. Snapshots appear in a hidden directory: ~snapshot
for Windows (SMB) or .snapshot
for Unix (NFS). Up to 255 snapshots can be made of any traditional or flexible volume. Snapshots are read-only, although ONTAP provides additional ability to make writable "virtual clones", based at "WAFL snapshots" technique, as "FlexClones".
ONTAP implements snapshots by tracking changes to disk-blocks between snapshot operations. It can set up snapshots in seconds because it only needs to take a copy of the root inode in the filesystem. This differs from the snapshots provided by some other storage vendors in which every block of storage has to be copied, which can take many hours.
7MTT
Each filer running Data ONTAP 8 could switch between modes either 7-Mode or Cluster mode. In reality each mode was a separate OS with its own version of WAFL, both 7-mode and Cluster mode where shipped on a single firmware image for filers till 8.3 where 7-mode was deprecated. It is possible to switch between modes on a filer but all the data on disks must be destroyed first since WAFL is not compatible and server-based application called 7MTT tool was introduced to migrate data from old 7-mode filers to new Cluster-Mode filers:
- With SnapMirror based replication called Copy-based transition which helped to migrate all the data with planned downtime using only storage vendor capabilities. Copy-based transition require new controllers and disks with space no less than on source system if all the data to be migrated. Both SAN and NAS data are possible.
- Starting with 7-mode 8.2.1 and Cluster-Mode 8.3.2 WAFL compatibility where introduced and new feature in 7MTT tool called Copy-free transition to replace old controllers running 7-mode with new controllers running Cluster-Mode and planned downtime, while new system require additional system disks with root aggregates for new controllers (it could be as less as 6 disks). Since with Copy-free transition no data copying required 7MTT tool helping only for new controllers reconfiguration. Both SAN and NAS data conversion supported.
Additional to 7MTT there are two other paths to migrate data based on protocol type:
- SAN data could be copied with foreign LUN import (FLI) functionality integrated in NetApp filer systems which can copy data over SAN protocol while new filer placed as SAN proxy between hosts and old storage system which require host reconfiguration and minimum downtime. FLI available as for old 7-mode systems and for some models of storage systems of competitors.
- NAS data could be copied with NetApp XCP free host-based utility thus host-based copy process processed with the utility from any copying data from source server with SMB or NFS protocols to ONTAP system with minimal downtime for client systems reconfiguration for new NAS server.
Previous limitations
Prior to the release of ONTAP 8, individual aggregate sizes were limited to a maximum of 2TB for FAS250 models and 16TB for all other models.
The limitation on aggregate size, coupled with increasing density of disk drives, served to limit the performance of the overall system. NetApp, like most storage vendors, increases overall system performance by parallelizing disk writes to many different spindles (disk drives). Large capacity drives, therefore limit the number of spindles that can be added to a single aggregate, and therefore limit the aggregate performance.
Each aggregate also incurs a storage capacity overhead of approximately 7-11%, depending on the disk type. On systems with many aggregates this can result in lost storage capacity.
However, the overhead comes about due to additional block-checksumming on the disk level as well as usual file system overhead, similar to the overhead in file systems like NTFS or EXT3. Block checksumming helps to insure that data errors at the disk drive level do not result in data loss.
Data ONTAP 8.0 uses a new 64bit aggregate format, which increases the size limit of FlexVolume to approximately 100TB (depending on storage platform) and also increases the size limit of aggregates to more than 100 TB on newer models (depending on storage platform) thus restoring the ability to configure large spindle counts to increase performance and storage efficiency. ([1])
Model history
This list may omit some models. Information taken from spec.org, netapp.com and storageperformance.org
EOA = End of Availability
SPECsfs with "*" is clustered result. SPECsfs performed include SPECsfs93, SPECsfs97, SPECsfs97_R1 and SPECsfs2008. Results of different benchmark versions are not comparable.
See also
- Network attached storage
- NetApp
- ONTAP Operation System, used in NetApp storage systems
- Write Anywhere File Layout (WAFL), used in NetApp storage systems
References
External links
- Storage Filer (definitions)
- SnapLock Technical Report
- NetApp training videos
- NETWORK-APPLIANCE (Mib file)
- NetApp end of availability information
Source of the article : Wikipedia