Consolidating for the Future - SSDs and Big Data - Samsung For Partners
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
WHITEPAPER Consolidating for the Future SSDs and Big Data This white paper will explore the impact of big data in various key fields and the important role solid state drives (SSDs) are to play in the future of data management, most notably the higher performance and cost savings offered over hard disk drives (HDDs) in the data center. Through consolidation, resources are used more efficiently, allowing an organization to better prepare for the future. Introduction “Big data is the new oil” is a popular phrase these days. From 2015 to 2019, big data and business analytics As the discovery of oil fields changed industries in the application sales are expected to increase from around 1800s, so too does the implementation of big data $122 billion to more than $187 billion, an increase affect lives across sectors today. There’s simply more of over 50%.1 The market is driven to collecting and information being collected than we know what to do collating data sets, a trend that is likely to increase with, and technological bottlenecks limit how much with time and technological improvements. and how quickly we can process the data. As hardware capabilities improve, however, we move ever further Big data is happening whether we prepare for it or not. into an era where big data shapes how our computers Upgrading data centers to optimize performance will and machines interact with us, and how they will not only help facilitate the influx of more data, but shape our lives. also yield cost savings for organizations.
SSDs are the future for data centers handling big data Solid-state disk drive (SSD) shipments from 2012 With torise the 2017 (in data of big million units) comes increased In 2013, SSD shipments totaled at 57 million units. requirements for processing the information, and By 2017, this number is expected to reach 189.6 200 189.6 data centers equipped with SSDs are the future. million units, a 333% increase. SSD purchases are on SSD implementation isn’t a new but growing trend. the rise. In five years, the market for enterprise SSDs is expected to be approximately 7.5 times larger than 150 SSDs need little introduction. Unlike HDDs which it is today.2 Unit shipments in millions store information electromagnetically, SSDs store data on interconnected flash memory chips. The The increase in both sales and shipments of 100 absence of moving mechanical parts in SSDs enterprise SSDs worldwide exhibit how more results in less seek time for data, giving almost organizations are opting to invest in SSD technology. instantaneous39 access. As such,57there’s a lower chance And the technology is rapidly advancing: from 50 for mechanical failure, whereas HDDs are composed the first quarter of 2015 to the first quarter of of a read/write head, an actuator and a spindle 2016, average SSD capacity grew approximately around which the platter revolves. HDDs were once 33 percent.3 Price-per-GB has declined with the 0 the most common 2012 form of drive 2013on the market. 2017* emergence of NAND SSD, leading the way to 3D Today this is changing. NAND, where NAND storage layers are stacked to increase density and thus capacity. By early 2017, revenues from enterprise SSDs will exceed revenues from HDDs worldwide. By 2019, SSDs have, in the past, been regarded as costlier with revenues from SSDs is forecasted to exceed that limited capacities, but in the past three years, SSD of HDDs by $3.811 million. retail prices will have dropped from $1 per gigabyte Revenues from enterprise hard and solid-state disk (HDD/SSD) drives worldwide from 2012 to 2019 (in million U.S. dollars) 17,500 15,356 15,000 13,515 11,475 12,500 11,036 11,374 11,545 Revenue in million U.S. Dollars 10,420 9,707 10,000 8,897 8,917 9,181 7,463 9,418 7,500 5,771 4,429 5,000 3,022 2,500 0 2012 2013 2014 2015* 2016* 2017* 2018* 2019* HDD SSD Consolidating for the Future | 2
Solid-state disk drive (SSD) shipments from But what does all this have to do with handling large 2012 to 2017 (in million units) amounts of data? Higher capacity and higher density drives are necessary for data centers of the future 200 189.6 if they are to handle big data. Where SDDs outshine HDDs is also its greatest gift to data processing: higher IOPS and lower latency. One enterprise SSD 150 can replace several HDDs, as we will see in this Unit shipments in millions paper, offering higher performance power in very small packages. 100 50 39 57 SSDs outperform HDDs 0 2012 2013 2017* Performance is a major draw to using SSDs in the data center. SSDs have an I/O latency of only 3 to 5 in 2013 to $0.23 per gigabyte in 2017. While this 3 microseconds by utilizing flash memory at speeds is still more expensive than the 4 to 5 cents per up to 3000 times faster than HDDs.4 With regards gigabyte it costs for HDD storage, it’s important to performance, one SSD is worth as many as 20 to note that price-per-GB should not be the sole HDDs.5 This makes SSDs ideal for working within measure of a drive’s worth. While SSD drives may environments that require lots of data input and have higher costs per gigabyte, a lower cost SSD can analysis (Tier 0 and high Tier 1 databases), as SSDs Revenues outperform from enterprise a higher priced HDDhardwhenand solid-state it comes to disk have (HDD/SSD) drives worldwide from much lower latency. 2012 to 2019 input/output operations per (in million second U.S. (IOPS). SSDs are dollars) additionally more cost effective than HDDs given Even the fastest HDDs, on the other hand, don’t their numerous cost savings, such as reduced power 17,500 function as well in high-traffic environments, as consumption and lower space requirements. they are limited to the physical disk scanning for 15,356 reading and writing. SSDs have no moving parts and 15,000 Furthermore, even though innovations in HDD 13,515 no physical sectors to scan, therefore they are not technology, like heat-assisted magnetic recording, limited by hardware movement. 11,475 12,500 11,036 11,374 11,545 are leading to increases in capacities, the fact that Revenue in million U.S. Dollars 10,420 HDDs utilize mechanical parts by nature give them 9,707 Boot time and transfer rates are faster with SSDs 10,000 8,897 8,917 9,181 an inevitable performance wall. At some point, as well. In situations where time is a factor, such HDDs would need to spin faster to achieve greater 7,463 as when dealing with financial transfers, or when 9,418 7,500 performance, increasing power consumption, performance speed is a matter of life or death, 5,771 decreasing reliability and creating serious such as with an autonomous car reacting to a road 4,429 overheating 5,000 issues. problem, the high-performance speeds of SSDs 3,022 help greatly.6 And2,500 that is not to say that SSDs offer limited capacities today. With innovations like 3D NAND, With additional advances in write-optimization and manufacturers can produce highly dense chips, the viability of compression and deduplication given 0 offering increasing2012 capacities 2013in very small 2014 2015* the fast performance 2016* 2017*of SSDs, 2018* organizations can 2019* packages (2.5” V-NAND SSDs are already available succeed with fewer SSDs than the number of HDDs by Samsung). HDD SDDs, whichSSDhave already dominated they would otherwise require. For those dealing with the under 1 TB consumer market, are growing fast big data, this level of performance and low latency is beyond the growth of HDDs and poised to take over a must. the realm of higher capacities. Consolidating for the Future | 3
SSDs are more to be replaced. SSDs have an AFR that is much lower - 0.5% - making it such that HDD failures in one year reliable can exceed SSD failures over five years.8 One minute of downtime can cost a company an average of $7,900.9 It’s far better to reduce that chance of failure. Additionally, wear-leveling technology optimizes the rewrite process of an SDD to even out usage across data blocks to prolong the lifespan of the drive. It’s like a tire rotation if your SDD were a car. The SDD controller manages data movement so that the P/E cycle is undertaken across multiple cells, rewriting data to new blocks dynamically and utilizing lesser used segments. Data may have an expiration date, but today’s SSDs are prolonging it further into the future. The importance of SSD technology in burgeoning sectors Unlike HDDs, SDDs have no moving parts, meaning While SSDs are already widely used in computers and there are no gears and spindles to degrade and mobile devices, SSD implementation in data centers ultimately break. The mechanics of HDDs themselves will prove crucial to burgeoning sectors dealing with create a bottleneck in high I/O environments.7 big data. Such industries requiring quick processing And because enterprise SDDs are protected against and predictive analysis require high IOPS and low power outages, data is protected against the latency. unexpected. The non-volatility of NAND SSDs, in particular, allow them to retain data even when the Imagine all the thinking that goes into driving a car power source goes missing. (responsibly) on the road; there are many factors to consider. For autonomous cars, the amount of Many who are unfamiliar with current SSD data inputted is even greater. While humans rely technologies may still be concerned with the on primarily visual and auditory cues, autonomous reliability of the drives. The lifespan of an SSD is cars of the future will utilize not only visual data but measured by a finite number of writes and rewrites also, to say the least, real-time metrics on traffic (the P/E cycle). While this may sound ominous, conditions from satellites, and analytics data on it’s not unlike general depreciation of any capital. automobile systems maintenance and the car’s Manufacturer warranties of SSDs today range from many sensors. All this data needs to be processed, three to five years, with some lasting as long as 10 analyzed and conveyed in real time for the safety years. While true longevity of a drive is determined of the passenger(s), ultimately resulting in up to by how it’s utilized in practice, we can see SSDs 1GB per second of metrics for current models of holding their own in terms of lifespan. self-driving cars (that’s 2 petabytes of data per car per year, as things currently stand).10 This number The annual failure rate (AFR) of HDDs is, on average, will rise dramatically in the future. As more sensors 3 percent. In an enterprise setting over a five-year and analytics are added, as well as the ability for period, this means that 15 percent of HDDs will need Consolidating for the Future | 4
cars to communicate with one another and to smart and business IT systems. While SSDs themselves can city installations, data storage and processing encrypt automatically, in these sectors where privacy requirements are compounded greatly. is tantamount, self-encrypting drives (SEDs) provide a stronger solution. For SEDs, Encryption is managed SSDs will play an influential role in the management by the firmware of the SSD, meaning no additional of big data for autonomous cars. Because safety software is required. Power-fail protection prevents is a factor, reliability is an issue. Data processing SSD corruption, while the nature of SSDs themselves requirements will go beyond simple navigational provide greater reliability than HDDs against needs, thus requiring faster I/O speeds to interact environmental factors. SEDs are cost-effective and with a vast network of constant information flow, reliable, providing all the benefits of SSDs with even an unseen world in which cars not only “speak” more security.12 with one another but where they also learn to be better drivers. This back-and-forth of compiling and Smart devices also need to conserve energy as many creating data places a spotlight on the autonomous are always on. SSDs are beneficial in this regard car industry as a contributor to big data, replacing oil because they can be placed on standby and switched as the key “driver” in transportation.11 on quickly. When security, reliability and read/write speeds for large quantities of data are involved, SSDs The Internet of Things are the optimal solution. The Internet of Things (IoT) is another key player in Wireless mesh networking the future of big data. As more smart devices compile information to enhance user experience, more data Wireless mesh networking takes the ideas present will be generated and communicated for analysis in autonomous cars and IoT and expands them to between systems. The same learning data points your create an entire smart city. Imagine streetlamps, future smart espresso machine might utilize to brew road bumps, traffic lights and signs all equipped a perfect caffè latte joins the countless other metrics with nodes (small transmitters) that allow them that include those used by your smart refrigerator to communicate with one another. Using dynamic to identify a lack of almond milk. But with so much routing, nodes know the quickest method of information flying around, one important requirement conveying information directly to other nodes while comes up that needs addressing: security. adjusting to find the clearest signal, bypassing the need for a central hub and eliminating concerns Even smart home devices of today generate concern of signal blockage. Because it’s wireless, mesh over privacy issues. This is even more true with computing provides great applications for many regards to industrial IoT (IIOT), such as with military industries and products of the future.13 Consolidating for the Future | 5
New nodes can be introduced without requiring a disruptive changes that will forever alter the way massive system update, as existing nodes can adapt we do things. All this data, of course, will need to be to new ones. As more nodes are introduced, the processed quickly and reliably. As storage capacities network expands, creating even more data. Where for SSDs increase in smaller chips, prediction once entire cities were illuminated by whale oil, then capabilities utilizing large sums of data increase gas, and now electricity, the cities of the future will dramatically. Technology is quickly catching up, be energized by silent nodes communicating back once again, to what was thought of as merely and forth, processing the data that will illuminate science fiction. our lives. As with autonomous cars and IoT, SSDs will provide a big role in the processing of all this The logical step forward from predictive analytics information because of reliability and low latency. and smart devices is artificial intelligence (AI). To This is especially true with 3D NAND SSDs, as create a machine capable of thinking, learning and citywide smart devices will require smaller drives to planning, it would require a system that can take in fit inside tiny devices. vast quantities of observational data and process it all at incredible speeds. The field of predictive analytics takes on big data analysis on a complex scale. Big data may be the new oil, but oil is useless without proper application. Analytics, from diagnostic to deep learning, is the application of big data. Predictive analytics is taking what is observed and forecasting, in real time, to predict and anticipate future behavior. Your favorite dating app or video recommendation application are innocuous precursors to a much bigger world of machine learning and prediction. Take the previous explorations of autonomous cars, IoT and wireless mesh networking and toss predictive analytics into the mix. One day, all that data, both current and historic, generated by the Storage with high IOPS is a big factor in developing numerous sources will be compiled for utilization. AI systems. With faster processing, the artificial Consumers benefit from real-time predictive analysis mind is all the better. This can be seen today in of such data because it allows companies and gaming while comparing today’s offerings to those organizations to provide greater user experience of the past. Computerized enemies in many games, (imagine a carsharing app knowing where in a particularly first-person shooters, are essentially sprawling city it ought to deploy most of its fleet of rudimentary forms of AI. While in the past such autonomous cars based on past consumer behavior). villains might only move back and forth, today they Subsequently, company’s benefit for anticipating anticipate gamer behavior and act accordingly. This is customer needs (such as when to increase shipping, thanks to increases in computing and storage power. inventory, etc. of an item). As more data comes in, As these two components increase in power, so too the potential for predictive analytics grows in ways does the potential for capable (and non-villainous) AI. hard to fathom today, as the utilization of various seemingly unrelated data points in real time can New designs for SSD chips provide a big leap forward yield behavioral forecasts that step outside the box in supporting AI. Advances that can pack more of today’s conventional thought. storage into smaller packages with lower power requirements, like 3D NAND SSDs, will be essential The application of real-time predictive analytics for becoming the “brains” behind future AI, driving with big data transcends industries and fields. From the exponential growth of SSD capabilities and sales to policing, predictive analytics can provide leading the way to smarter machines. Consolidating for the Future | 6
Where do we go from here? So how does an organization prepare itself for a future swimming in evermore data? Consolidation is the key. Whether your organization is preparing to work in the aforementioned sectors or is just looking to make its data center more effective, consolidation is a practice worth pursuing to increase efficiencies and reduce costs. CONSOLIDATION AND COST SAVINGS Consolidation is the reduction of data center Total Cost of Ownership machines, making the most of the resources at hand. Since 2009, shortly after the financial crisis, In 2016, worldwide data center systems spending Wells Fargo underwent the consolidation process was about $173 billion, projected to increase to $177 via virtualization and closed 100 data centers billion in 2017, 5% of overall IT spending ($3,486 while increasing efficiencies. The carbon footprint billion).(16) Data centers play an integral part in the of their mainframes was cut in half, while energy future of big data. Upgrading HDD servers with SSDs consumption dropped as much as 5% annually.14 can reduce the total cost of ownership (TCO) of the Additionally, the United States government saved data center. TCO is comprised of two costs: capital $3.3 billion in the consolidation of its servers since expenditures (CapEx) and operating expenditures 2011.(15) For organizations that run data centers, (OpEx). whether big or small, consolidation is key for CapEx refers to the costs of obtaining capital. While optimizing performance and lowering costs. SSDs cost more per gigabyte, they are higher in density and perform better than HDDs. Data centers Regardless if it’s about reducing the number of utilizing SSDs require fewer racks than those utilizing machines or making the most of your current data HDDs to obtain the same level of performance. This center space, it goes without saying that SSDs, which can translate to lower CapEx costs for those building offer higher performance per drive than several data centers, because fewer SSDs are required to HDDs, can be an important factor in consolidation. match the performance of more HDDs. For those Other benefits of consolidation include increased consolidating, even more of the savings with SSD reliability, reduced real estate costs, and lower implementation will come from OpEx. maintenance costs related to cooling, labor, and, most notably, power consumption. In this section, OpEx refers to the costs of operating the capital, we’ll explore these savings, and several others, and covering a wide variety of costs from the electricity make the case for consolidation. bill to server maintenance and labor. Nearly 80% of data center costs follow the initial acquisition of the But to truly understand the cost savings of capital.(17) For this reason, making the data center implementing SSDs in the data center, we need to more manageable can yield significant cost savings; look at the Total Cost of Ownership (TCO) of a data it’s in OpEx where the greatest savings can be found center. with consolidation. For example, a small 5 to 10 rack data center with cooling, power, fire suppression and Tier 1 security can cost approximately $4,500 per month in total ongoing maintenance, of which Consolidating for the Future | 7
the highest costs are power ($2,500 per month; 55.6% of total SAMSUNG ongoing costs) and cooling ($750 per month; 16.7%).18 Reducing power consumption alone can make a huge difference and SSDs can help. The Storage Networking Industry Association (SNIA), a non- profit authority on the storage industry, conducted a study with an email server configured for 2,000 active Blackberry email users with 100MB storage per address and 3 IOPS per user. For the requirements, they concluded that 18 SAS 15K 146GB HDDs would be necessary, while only two SATA 256 256GB SSDs could be used otherwise. The sets of HDDs and SSDs were compared over a period of five years. The TCO of the HDDs came out to be $17,063 while the total cost of the SSDs was $6,348. While the upfront cost of the HDDs was $13,177, the SSDs cost $6,273 (47.6% Samsung’s 32TB NAND SSD, set the cost of the HDDs). The margin was even more apparent with for release in 2017, more than subsequent per year costs: the HDDs cost $987 per year while the doubles the storage capacity of SSDs cost only $19 (1.9% the cost of the HDDs) to maintain. The the PM1663a SSD, released in biggest savings came from power consumption and cooling, with 2016, from 15TBs to 32TBs by SSDs costing only 1.9% that of HDDs. To put it simply, fewer SSDs utilizing 64 vertically stacked can do the job of many HDDs at a fraction of the TCO.17 NAND storage layers in one chip package, the highest Because SDDs use less than half the power of HDDs while density currently available delivering hundreds of times higher performance in terms of in an SSD chip. Additionally, IOPS and require less cooling, major OpEx costs can be drastically the NAND SSD fits 2.5-inch reduced with SDD implementation in the data center. slots (as opposed to 3.5-inch slots), facilitating insertion into smaller drive slots. Reducing ongoing costs Performance-wise, the 32TB SSD allows for higher sequential in your data center read/write performance in smaller devices with greater power efficiency. These speeds Data centers use a lot of power. According to the U.S. Department are essential in processing lots of Energy, data centers in the United States used an estimated of data in real time, using a 70 billion kWh, or 1.8% of total domestic electricity consumption, flash array that blurs the line in 2014, the most recently measured year. That’s 2 percent of all between caching and storing. domestic power output.19 Data center energy use is expected to Based on 4th Generation increase 4 percent from 2014-2020, to an estimated 73 billion vertical NAND (V-NAND) kWh in 2020.20 technology, the 32TB SSD will be the largest capacity SSD in Replacing server racks currently running HDDs with SDDs allows the industry. organizations to reduce their data center footprint, or get more out of the space they currently have, since servers use 50 percent of a data center’s power.21 At peak, SSDs use about 2W each, while HDDs use about 6W, making SSDs the obvious choice for energy cost savings.22 Additionally, should data processing requirements grow, it’s far safer to optimize current space than to acquire new space. Consolidating for the Future | 8
Expansion can be risky because construction can cause data outages, annoying customers and Bridging the future damaging company reputation, as well as create security concerns should the servers contain with 3D NAND sensitive data. As such, data center energy expenditures, one of the top OpEx costs, could stand to be lowered. The practice of consolidation is one promoted also by the United States government in the hopes of reducing data center sizes and energy usage. The financial gains for a company consolidating its data centers are such that a 20 percent reduction in electricity usage could result in savings of approximately $90 per server per year on the energy bill, as well as reduced capital costs for the facility by $2,000 per server.23 Increasing efficiencies with dense, high- We’ve seen in this white paper how SSDs can performance SSD drives can offer more revenue from outperform HDDs in performance, reliability and cost higher transactional speeds with a reduced operating savings. But SSDs have been improving far faster space. than HDDs as they stand as a technology now. It’s in these innovations that SSDs will far surpass HDDs. Cutting power consumption creates a more social benefit as well. In 2013, data centers in the United It isn’t until we explore the potential of 3D NAND States produced approximately 97 million metric that we see the true future of SSDs. There’s only so tons of CO2. In 2020, it’s predicted that data centers much one can fit on a 15nm transistor; 3D NAND will produce 147 million metric tons.24 With so builds upwards in the same amount of space. much work going to offsetting carbon footprints, With stacked layers of memory cells, 3D NAND consolidation can save a company from any related scales 2D (planar) NAND to greater capacities and taxes and fees. Such efficiency improvements are performance, with increased densities that result in expected to save 620 billion kWh between 2010 lower costs per bit. Vertical architecture allows up to and 2020.20 100 layers of cells to be stacked with potentials to scale densities up to 1 terabit as we forecast today. Another benefit of utilizing fewer servers upgraded with high-performance SSDs is that there are 3D NAND offers benefits beyond vertical stacking. reduced touch points for management. Cutting the Advances in algorithms additionally speed data number of servers and data center space means writing to up to two times faster than traditional there’s less of a management burden on hired SSDs while utilizing 45 percent less energy. Non- labor. Additionally, SSDs create less noise, so the conductive insulation holds charges to prevent data noise pollution within the data center is reduced as corruption, thereby increasing reliability beyond the well, something the management staff would likely traditional SSD. appreciate. In the near future, SSDs will overtake HDDs in Again, much of the TCO of a data center falls per gigabyte cost savings as it already has in under OpEx costs. High-density SSDs lead the performance, and it will be 3D NAND leading way in allowing for data center consolidation to the way. substantially reduce such expenditures. Consolidating for the Future | 9
SUMMARY It seems like a long time now since the days of the early oil pioneers. But as Rockefeller capitalized on fields of oil, so too will today’s organizations benefit from fields of data if they are prepared to do so. Innovations like 3D NAND will optimize data centers efficiently and reliably, allowing organizations to consolidate with SSDs, generating more processing power within comparatively smaller spaces to data centers operating with HDDs. With the cost of SSDs decreasing alongside increases in storage capabilities, the future for companies operating in a world of big data looks mighty good. References 1 http://www.informationweek.com/big-data/big-data-analytics/big-data-analytics-sales-will-reach-$187-billion-by-2019/d/d- id/1325631 2 https://www.idc.com/getdoc.jsp?containerId=US40422516 3 http://www.anandtech.com/show/10348/q1-2016-market-trends-ssds 4 http://www.itproportal.com/2014/04/15/real-time-storage-tiering-for-real-world-workloads-the-ultimate-guide/ 5 http://www.networkcomputing.com/storage/ssd-pricing-vs-hdd-costs/851436865 6 http://www.ingrammicroadvisor.com/components/an-ssd-comparison-to-hdd-why-solid-state-drives-are-better-for-business 7 http://www.enterprisestorageforum.com/storage-hardware/ssd-vs.-hdd-performance-and-reliability-1.html 8 http://www.slideshare.net/SamsungBusinessUSA/ssds-in-the-data-center-the-total-cost-of-ownership-advantage 9 http://www.qtsdatacenters.com/resources/blog/2016/04/07/a-variety-of-factors-affect-data-center-cost 10 https://datafloq.com/read/self-driving-cars-create-2-petabytes-data-annually/172 11 https://datafloq.com/read/how-autonomous-cars-will-make-big-data-even-bigger/1795 12 http://www.embedded.com/electronics-blogs/say-what-/4442363/In-SSDs-we-trust--but-they-need-to-earn-it- 13 http://computer.howstuffworks.com/how-wireless-mesh-networks-work.htm 14 http://searchdatacenter.techtarget.com/news/4500243727/Wells-Fargo-continues-massive-data-center-consolidation 15 http://www.teamquest.com/en/news/blog/2016/01/data-center-consolidation-benefits/ 16 http://www.gartner.com/newsroom/id/3482917 17 http://www.snia.org/sites/default/files/SNIA_TCOCALC_Workpaper_Final.pdf 18 https://ongoingoperations.com/data-center-pricing-credit-unions/ 19 http://www.snia.org/sites/default/files/SNIA_TCOCALC_Workpaper_Final.pdf 20 https://eta.lbl.gov/sites/all/files/publications/lbnl-1005775_v2.pdf 21 http://datacenterfrontier.com/reduce-power-cost-and-tco/ 22 http://www.ingrammicroadvisor.com/components/an-ssd-comparison-to-hdd-why-solid-state-drives-are-better-for-business 23 http://www.premiersolutionsco.com/wp-content/uploads/2010/12/Total-Cost-Of-Ownership-For-Data-Centers.pdf 24 Computerworld; Natural Resources Defense Council Consolidating for the Future | 10
You can also read