We use cookies to give you the best online experience. By using our website you agree to our use of cookies in accordance with our cookie policy.

croom new

Preferred Blog

Preferred has been serving the Tinley Park area since 1991, providing IT Support such as technical helpdesk support, computer support, and consulting to small and medium-sized businesses.

Meet Google Spanner: The Largest, Single Database Ever Created

The evolution of Google from a search engine company renting a garage, to one of the largest companies on the planet, is a story that can truly boggle the mind. One cause of Google's growth is their investment in developing new technology. This is the case with the Google Spanner, which is a new technology designed to manage Google's growth.

At the heart of the Google empire is data. Google houses all of their data from all of their various ventures (YouTube, Gmail, Google Docs., etc.), in giant warehouses all around the world called data centers. Each data center is packed full of servers that stores the world's data on thousands of hard drives. Google's showcase data center, in their hometown of Mountain View California, has an estimated 45,000 servers, and this is just one of perhaps a dozen or more data centers around the world.

Google has hundreds of thousands of servers operating all around the globe, Datacenterknowledge.com estimates Google operates 900,000 servers based on power consumption reports released by Google. Herein lies the challenge, almost a million servers housed in data centers around the globe, serving the information needs of people in almost every country, and all of this data needs to be synced across the world's 24 different time zones.

The Google Spanner is a massive database that is designed to efficiently scale this complex flow of data to a single time. Spanner will timestamp all of Google's data from all around the world using True Time API, which uses an atomic clock combined with a GPS clock in order to sync every server in every Google data center. Google released a very detailed 14 page paper explaining how Spanner works; here is an excerpt with a technical explanation (you might want to put on your thinking hat for this one):

Data is stored in schematized semi-relational tables; data is versioned, and each version is automatically timestamped with its commit time; old versions of data are subject to configurable garbage-collection policies; and applications can read data at old timestamps. Spanner supports general-purpose transactions, and provides a SQL-based query language.

And breathe. In order to achieve this colossal data syncing feat, every one of Google's 900,000+ servers will need to be fitted with a GPS antenna and an atomic clock. Therefore, it might be a good move to find out who manufactures these devices and purchase stock in that company. The Google Spanner is incredibly efficient, able to automatically balance the demands for data across all of Google's servers. Spanner also uses a data structure that lets the applications control the data locality, this means the data retrieval time for users will be at a maximum.

There are enough technical details on how the Spanner works to overheat your graphing calculator. One last detail that we would like to point out is the sheer scope of the Spanner's processing power. Google has designed the Spanner to handle up to 10 million servers! The Spanner is Google's effort to lay the groundwork for future company growth, ensuring they will be able to grow ten times their current size. Considering how far Google has come from a few servers crammed into a garage in 1998, to their current incarnation, it should not be too long before Google reaches 10 million servers and has to invent yet another technology to keep up with its growth.

While you may not be facing the same ginormous server challenges as Google, you may have servers in your office that are underutilized. Just like the Google Spanner is an attempt by Google to maximize efficiency from all of its servers, so too can Preferred evaluate your server infrastructure and determine best practices that can reduce power consumption. Through a process called virtualization, we can even reduce by at least 50% the number of server units that you are using. Call Preferred at 708-781-7110 and let us show you how you can get the most out of your technology.

Is a Mobile and Cloud-Based IT Infrastructure Righ...
Map Out Your Network’s Growing Pains
 

Comments

No comments made yet. Be the first to submit a comment
Guest
Already Registered? Login Here
Guest
Thursday, December 19 2024

Captcha Image

Sign Up for Our Newsletter

  • First Name *
  • Last Name *

      Latest Blog Entry

      Many of today’s businesses span continents. That’s not an exaggeration. With faster Internet speeds fueling remote workforces, some businesses choose to strategically use workers from all over the globe to create and sell their products or services. In today’s blog, we’ll di...

      Latest News

      Best Places to Work - 5 YEARS STRONG!!

      Preferred is once again, honored for being a Best Place to Work for the fifth straight year!  Our team is what makes Preferred a Best Place to Work.  

      Daily Herald Suburban Business 2024 Best Places to Work Honorees The Daily Herald Suburban Business has announced the names of 51 companies, in 5 categories of competition, that are honored as the 2024 Best Places to Work in Illinois. This statewide survey and awards program was designed to identify, recognize and honor the best places of employment in Illinois, benefiting the state's economy, its workforce and businesses.