Uploaded by Jordan Gillette

Last Research Paper - Jordan Gillette

advertisement
Gillette 1
Jordan Gillette
Mrs. Grogan
English 101
05 May 23
The Future of Digital Data Storage
As we progress into the digital age, our demands for digital data storage are mounting.
The demand for more storage is quickly expanding even in our own homes with higher
resolutions in pictures and videos. There are many solutions to this problem, two of which are
cloud storage, which many are already familiar with or a personal home NAS (network-attached
storage). A NAS is a specialized computer designed solely for storage and sharing data with
whoever is allowed access to it like family, friends, or on a larger scale, even a whole company.
With increasing file sizes and the demand for data storage, there is a need for an online storage
method. A home NAS is an excellent solution to that problem, with an excellent price-to-size
ratio and robust redundancy.
We must first discuss the different types of drives to go deeper into what a NAS is. A
drive is a physical device that stores data, and it is what you plug into your computer to read and
write information. There are several different types of drives used in the modern day, including
SATA (serial advanced technology attachment), HDD (hard disk drive), SATA SSD (solid-state
drive), and an M.2 NVMe (nonvolatile memory express) drive.
SATA and M.2 are standards for attaching drives to the motherboard. SATA is a standard
that uses a cable to connect the drive to the motherboard. It is a cheaper solution, but it has some
drawbacks as they are limited to the speed that they can go. In contrast, M.2 is a much faster
standard that connects directly to the motherboard using PCIe X4. This type of connection has
Gillette 2
direct leads connecting it to the CPU, vastly outperforming any SATA drive (Amsbarra). The fact
that it directly connects to the motherboard making it so fast also makes it a bad option for most
NASs as there are very few spots for NVMes on the motherboard.
The data can be physically stored on a drive in two ways: an HDD or an SSD; NVMe is
technically a form of SSD. An HDD uses a physical disk that spins like a CD. It has a
read-and-write head gliding over those disks, and the data is stored using magnetic forces. With
the moving parts, the lifespan of these drives is measured by a certain number of reads and
writes, for a high end one that is about 600,000 load unload cycles, not a specific amount of
time. On the other hand, the SSD has no moving parts and instead works by using electricity to
flip the charge of gates in NAND chips. With no moving parts, its lifespan is measured by the
number of years until the materials break down and stop holding charges accurately.
The final comparison of these drives deals with speed. NVMe is the fastest option, but
there are better options for logistical reasons, as stated before. For a NAS, SATA, and SSDs are
better options. An SSD is much faster than an HDD, but it also comes at a higher price. “Average
SSD Price Compared to HDD − 3x-5x higher cost” (Mohammadamin). When it comes to a NAS,
you probably would not have the HDDs spinning at all times, so you would have to factor in a
spin-up time when you start using it. It can add several seconds of latency, an essential factor in
choosing a drive.
Within SATA drives there are two different form factors. There is a smaller version that is
2.5 inches. “the 2.5-inch SATA drive and is most often used in laptop computers and small form
factor desktops”(Amsbarra). They are great for many applications, but for a NAS the physical
size is not a huge concern as the main focus is how much data they can hold. The lager size is the
3.5 inch drive. “the 3.5-inch SATA drive and these are used in all mid- and full-size desktop
Gillette 3
computers.” (Amsbarra). This size of drive tends to hold more data and because of that it is the
better choice for a home NAS.
The next step to understanding how NAS works is understanding how we combine all of
the drives that we have in the system into one single storage device. We achieve this through
RAID (Redundant Array of Independent Disks). There are many different types of RAID. You
have RAID 0-7, 10, 50, and 60. For this paper, we do not need to go into RAID 10, 50, and 60 as
they are just combinations of other RAIDs and are more focused on an enterprise market than a
consumer. We also do not need to go into RAID 2, 3, and 4 as they do not apply to a modern
NAS. RAID can combine these drives through software housed in the BIOS (basic input/output
system). The BIOS is the OS (operating system) of the motherboard. It is always running under
your computer’s regular OS. Because it runs under the regular operating system, it can grab,
combine, and manage all the different singles from the drives. It shows one combined storage
device to the regular OS, which you would see while using the computer.
Within the RAID there are several different types. Some of the important ones are RAID
0, 1, 4, 5, and 6. Below is a chart that represents the different types. A good analogy is where we
represent each drive as a piece of paper, and the data as words being written on that paper.
Gillette 4
For RAID 0, imagine you take all your
papers and line them up side by side.
Then you would start writing on the
first line of the first page. When you get
to the end of the first line, instead of
going to the second line of page 1, you
go to the first line of page 2. When you
get to the end of the first line of page 2,
then you go on to page 3, and so on till
you get to your last page. When you hit
the last page, you go on to the second
line of page 1 and repeat. This general
concept, done with data and drives, not
(Chen et al.)
paper, is called striping.
Striping is used in most types of RAID, but RAID 0 is solely striping. RAID 0 is
high-speed, as you can write on all the drives simultaneously. With this speed comes a
considerable cost; if one of your drives fails, you lose all of the data on all of them because you
lose the critical context of what connects the data between the drives. RAID 1, on the other hand,
is like RAID 0 in how you strip the data across the drives, but you have a second set of drives
that has an exact copy of all of the data. You can see this represented by the diagram above,
where the white is the original drive, and the gray is the copies. This setup is highly redundant,
which means you can lose drives and not necessarily lose data. You can lose any one drive to
failure and sometimes even more, but that is not a guarantee. This style also comes at a
Gillette 5
significant cost; you lose half of the storage space of the drives as usable capacity, so it is
inefficient.
Another type is RAID 4. For this one, envision lining up all your papers and writing
across them just like you did for RAID 0. However, when you get to the last page, instead of
writing across the first line of the last page, you write a quick summary of what you just wrote on
the other first lines. Then you move to the second line of page 1. If you look at the diagram
above, you can see how that would be, with all the white being the regular data and the gray ones
being the summary, also called the parity data. The parity data allows for redundancy where you
can lose any one drive without losing everything, without losing as much of your capacity.
RAID 5 is very similar to RAID 4 in how it has striped data, with one drive per line being
reserved for parity data. However, in RAID 5, which drive is responsible for the parity data
changes line by line. As you can see in the diagram, the dive that has the parity data is constantly
changing. This provides some slight effects and benefits over RAID 4 in terms of storage
capacity and speed of rebuilding the RAID if there is a drive failure. Another advantage of it is
“using the RAID5 technique can reduce the page error rate compared with ECC” (Seung-Ho and
Ki-Woong). RAID 5 is superior to RAID 4 in all ways.
RAID 6 is the final RAID we will review in this paper. It is similar to RAID 5 in that it
has parity data and that data is rotated throughout the different drives. It differs because you
reserve two drives per line for parity data, each containing the same parity data. With this
system, you can lose any two drives and still have enough data to rebuild the entire data set
without loss. However, this comes at a slight speed and storage cost over RAID 5. Both are great
options for a home NAS.
Gillette 6
There are a couple different ways to set up a home NAS. There are two main options for
looking at a home NAS. You can either make one yourself or buy a prebuilt one. Building one
yourself requires a fair degree of technical know-how, but you can end up with a system that is
far more specialized to your needs and far cheaper, sometimes several hundred dollars cheaper.
When you buy a prebuilt, it is far more time effective, requiring less work and menace on your
part. Both are great solutions for a home NAS. It depends on your situation which is better.
When comparing a NAS to cloud storage, both have advantages and disadvantages. The
price per storage for cloud storage is quite good, but as the amount of storage you require, a
home NAS will quickly perform it. You can buy a Synology drive with 8Tb of storage for 500$
on Amazon while getting just 5Tb with google drive will cost 250$ a year. In just two years,
buying your own NAS makes more sense. The main disadvantage to a home NAS is that “If two
or more users are modifying the same file, they are doing so to their private copies of it. Neither
sees the other’s changes and the last one to close the file will upload the changes, overwriting the
other user’s version.” (Krzyzanowski). This is not an issue if you are storing a bunch of photos
and videos or if you are the only one to use it, but if you have many documents that multiple
people will go and edit, then a cloud solution like google drive is far better.
A NAS is an excellent method of storing data, but it is not the right solution for everyone.
If you have significant storage needs and do not have demands for concurrent editing, then there
is nothing better than a home NAS.
Gillette 7
Works Cited
Amsbarra. “Types of Hard Drives - Engineering Information Technology Help.” Engineering
Information Technology Help -, 30 July 2020,
it.engr.ncsu.edu/help/kb/types-of-hard-drives/.
Chen, Peter M., et al. “RAID: High-Performance, Reliable Secondary Storage.” AMC Computer
Surveys, www.pdl.cmu.edu/PDL-FTP/RAID/computin.pdf. Accessed 12 May 2023.
Krzyzanowski, Paul. “Network Attached Storage Distributed File Systems.” Network Attached
Storage, Mar. 2021, https://people.cs.rutgers.edu/~pxk/417/notes/nas.html.
Lim, Seung-Ho, and Ki-Woong Park. “Compression-Assisted Adaptive ECC and RAID
Scattering for NAND Flash Storage Devices.” Sensors (14248220), vol. 20, no. 10, May
2020, p. 2952. EBSCOhost, https://doi.org/10.3390/s20102952.
Mohammadamin , Ajdari. “A Scalable System Design for Data Reduction in Modern Storage
Servers.” Sharif University of Technology, 22 Jan. 2020,
sharif.edu/~kharrazi/data_talks/slides/data-talks-ajdari-98-11-2-slides.pdf.
Download