Adotas

Where media buyers start online


Featured Author

Gary Orenstein is vice president of technical solutions at MaxiScale. He has served in leadership marketing roles at numerous networking and storage companies. Prior to MaxiScale, he was the vice president of marketing and business development at Gear6, a caching and file acceleration pioneer. He also served as vice president of marketing at Compellent, and was a co-founder at Nishan Systems, acquired by McDATA/Brocade. Mr. Orenstein is the author of "IP Storage Networking: Straight to the Core." He holds an MBA from the Wharton School at the University of Pennsylvania, and a BA from Dartmouth College.

More articles by Gary Orenstein






Features

File Service You Can Count On

Written on
Dec 7, 2009 
Author
Gary Orenstein  |
Share
File Service You Can Count On

service_small.jpgADOTAS – Serving online advertisements places unique burdens on data center storage infrastructure. Online ad networks serve billions of small files, including text, images and video clips, while simultaneously tracking every click to fine-tune ad relevancy and ensure proper billing for ads served. Ad-serving applications must be always online to operate 24 hours per day -– downtime means lost revenue because ad networks are paid by the number of ads placed.

A leading mobile advertising network recently reported over 110 billion ad impressions served in three years. Statistics like this showcase the critical nature of storage infrastructure and the need to move beyond existing systems which cannot keep pace with traffic increases. Ad serving networks can follow basic guidelines to eliminate infrastructure bottlenecks, ensure proper click-through tracking and billing, and easily handle rapid expansion.

Unique File Serving Requirements

The direct linkage between revenue, serving small files and capturing voluminous log files poses challenges and opportunities for ad networks. Companies who maximize their ad content serving capabilities and effectively tune ad targeting schemes based on usage logs stand to capture market share from competitors and increase profitability. Ad networks handle millions to billions of ad requests, most of which are relatively small files, leading to intense pressure on file serving infrastructure. Storage systems must address key requirements to keep up with the workload.

Fast response time: Ad networks must serve ads within milliseconds upon request to meet service level agreements and bill their customers. Typically, ad networks commit to a service level agreement that ads will be delivered in a set period of time. If they cannot serve an ad within that window, they lose the opportunity to generate impressions and click-throughs, reducing revenue.

Large content libraries: Ad networks deliver unique ads or combinations of ads to each user, requiring systems to support random access across a large library of ad content. Predictable response times to randomly accessed content must decrease or remain constant as the amount of ad content scales. Fast ad serving directly from disk is critical because random access patterns to a large data set are not easily cacheable.

Easily add capacity for peak demand: Ad serving systems must scale to support peak loads. Preparing for seasonal traffic should not require months of planning, but rather enable administrators to incrementally add resources that integrate seamlessly and deliver a proportional increase in overall performance.

Transaction logging: Logging workloads are very different in comparison to small file serving. Instead of quickly retrieving ad content, ad networks drive log updates into the file serving infrastructure. Large ad networks rely on logging for ad relevancy and accurate billing. Log data comes to the system from up to hundreds or thousands of Web servers simultaneously. It is critical to handle hundreds of updates per second, often as atomic appends to prevent collisions between log writers.

High availability: Ad networks must deploy highly available infrastructure for 24/7 operation in support of partner websites and to prevent lost revenue opportunities. Storage systems that support ad serving applications must be impervious to individual disk failure or node failure. File systems must maintain ad serving capabilities throughout such disruptions.

Traditional Storage Falls Behind

Traditional storage systems, including those designed just a few years ago, did not anticipate the unique requirements of Internet-scale applications and verticals such as ad serving. Legacy file systems cannot effectively or economically meet these workload demands, and fall short in several categories.

Costly, proprietary hardware: Traditional systems often rely on custom-built, expensive hardware that has a short lifespan and high ongoing maintenance costs. Upgrades are typically full-forklift leading to premature spending.

Slow throughput at scale: Storage hardware vendors typically build larger “boxes” in a scale-up approach and place too many disk drives behind a single controller, causing performance bottlenecks that cannot economically serve many simultaneous requests. Scale-out limitations often force administrators to deploy entirely new islands of storage once the capacity of existing systems is reached. This leads to excessive spending and delays in provisioning and deploying storage.

Not built for web-scale applications: Storage designers just a few years ago could not have anticipated our present era of billions of file operations in Web-scale applications. Consequently, traditional storage systems do not efficiently handle many small files requests, they do not support effortless logging, and they do not include the integrated abilities of a key-value or object store. Ad networks can take advantage of all these functions to consolidate systems, improve reliability and reduce overall costs.

Distributed Systems Meet Ad Serving Demands

Ad serving requires a new approach to storage due to its scale and workload. Distributed systems enable ad networks to build more robust and less expensive file serving infrastructure in several ways.

Inexpensive, commodity hardware: Distributed systems can employ smart software and take advantage of standard, off-the-shelf hardware to achieve the lowest overall costs. Administrators can cut capital costs and maintain lower operating costs by avoiding proprietary hardware systems.

Consistent, low-latency ad content serving: The scale out approach balances CPU power, network bandwidth, memory and storage capacity, enabling a distributed system to consistently serve ads quickly regardless of how large the system grows. Traditional systems fail to match I/O throughput in proportion to capacity growth.

Single namespace at web-scale: Distributed systems provide a single namespace at scale, dramatically simplifying system management and ensuring that application workloads can grow without requiring fork-lift upgrades of the file serving and storage infrastructure. Traditional systems reach maximum capacity relatively quickly in Web-scale applications, leading to multiple storage islands and increased management burden.

Optimize for web workloads: A storage system optimized for Web workloads drives system consolidation, simplifies management, and yields a more cost-effective approach. Ad networks employ a combination of Web workloads including small file serving, logging, and analytics. Traditional systems are not optimized for these workloads, often resulting in costly duplication of systems and processes.

Online ad networks face challenging file serving throughput and capacity requirements. Traditional storage systems have been unable to keep pace with industry growth at price points and degrees of manageability necessary to run profitable businesses. New, innovative solutions built around distributed systems are ready to step in as Web traffic continues to grow.





Reader Comments.

No comments yet

Leave a Comment

Add a comment

Tags: and
Article Sponsor

More Features



  • Is Apple's Ping a threat to other social networks?
    Loading ... Loading ...

Polls Archive

Latest News

News Archive

Spotlight

Spotlight: Vulcan Spring Testifies to GlobalSpec’s ProwessADOTAS – With potential customers stretched across a wealth of industries, Vulcan Spring sought GlobalSpec’s expertise in reaching engineers and [...] more...


Adotas Partnership