Greenpeace: Cloud, Consumption & Climate Change

Greenpeace has now turned its attention to cloud computing, its impact on the energy consumption and the resultant influence on the environment. A somewhat loosely compiled report takes a shot at the various cloud initiatives of the consumer tech majors like Google, Apple, Microsoft, Yahoo etc. and arrives at a broad set of concerns that need to be addressed.

The Greenpeace chart details the power needs for some of the major data center projects. 


The report focuses on the role data center builders can effectively play in the environment al sensitive design and investments that look to be highly needful. The report envisages a role for data center builders in becoming an integral part of the solution to changed in global climate centered issues and emphasizes the positive impact that these players can make on the tech ecosystem and by extension on the extended universe . Greenpeace commissioned studies on carbon footprint of the data centers show that there is a wide space that can be covered by them potentially leveraging clean and renewable energy while building the business cloud infrastructure for their customers. Key approach recommended if for the data enters to pressurize the government , local bodies and energy manufacturers like utilities to make renewable energy available readily.

Metrics show the enormity of the task at hand. Greenpeace believe that public awareness is the best ammunition in the pursuit of any cause. On the week of the launch of the Apple Pad, it brings out data in support of its position. Apple investing in NC data center is tied to cheap power available in large measures from coal fired plants therein. In its report titled, Cloud Computing and its Contribution to Climate Change, Greenpeace provides estimates that data centers power consumption could peak at a million megawatt hours of power by 2020 and its sister cousin industry viz the telecom sector could be using energy around 950,000 megawatt hours. Consistent with its focus on advancing its perspective via news coverage, Greenpeace tied its report to the release of this week’s release of the Apple iPad. The reality is that in the last decade or so there has been a massive proliferation of computing power in the form of desktops, laptops, smartphones, iPods, tablets, e-readers, gaming interface devices – all these not only consume power by themselves but are also massively connected with cloud grids – all consume power in large numbers. Greenpeace centers its concern mostly on the 2008 analysis by the Climate Group and the Global e-Sustainability Initiative, and finds that the cloud could consumption of energy could massively explode (almost triple) in the next ten years. One of the projections in the report include that data centers would produce more carbon than the airline industry by 2020 “if current trends go unchecked.”
The cloud energy phenomenon, as the report recognizes is going to be a bit of a paradox in managing. On the consumption side, major conscious tech majors – the likes of Google who consume power have a major incentive in bringing more efficiencies in power consumption and are actively working towards advancing the field aiming at reducing energy consumption. The hope is that the best practices and technology that they advance would in time progress to cover the tech ecosystem. While this can happen, the fear is that the connected devices around the world could massively push up the energy consumption at an aggregate level. All of us have more connected devices at home compared to what we had in the last 5 + years and so this explosion should rightfully worry the likes of Greenpeace. The report, part of Greenpeace’s cool It Campaign, builds on the group’s criticism of Facebook for building its new Oregon data center in an area where utility power comes primarily from coal. Incidentally, there is a Facebook group urging Facebook to use only clean energy but has not been able to move the needle a tad bit. So much for Facebook activism to show or fail to show real results! The important factor here is that all the criticism of IT energy usage, the sector is an important lighthouse for improved sustainability. “While the sector plans to significantly step up the energy efficiency of its products and services, ICT’s largest influence will be by enabling energy efficiencies in other sectors, an opportunity that could deliver carbon savings five times larger than the total missions from the entire ICT sector in 2020,” points out the 2020 report.

Greenpeace writes. “The industry also needs to take responsibility for where it gets its energy from in the first place. Simply put: Will the cloud run on coal or renewable energy? -“We are calling on IT industry giants to put their might behind government policies that give priority grid access for renewable sources like wind and solar energy,” Greenpeace said. “IT companies should also support economy-wide climate and energy policies around the world that peak climate emissions by 2015. … The great innovators of the digital age can and should be leaders in promoting an energy revolution.”
In reality, as things stand today, clean and renewable energy is unfortunately not the one and only criteria for data center location selection and factors like local incentives, benefits, energy grid pricing mechanisms all play important roles in decision making . Greenpeace is right in highlighting the fact that the issue of clean energy needs to be pushed into the center of decision making by data centers but realizes that this won’t be entirely possible to be acted upon just by data centers and calls for mobilizing public and political views to support this. While Greenpeace and other activists could focus on cloud service providers, there is a larger role to be played by the public. Many things are expected to change in the next few years. There are expectations that there would be a spurt of common green initiatives that include the likes of e-documents, tele-working, reduced travel etc.  Technology can also provide many analytic tools that others in business can use to monitor and reduce energy consumption in real time in areas such as transportation and other carbon management activities. Optimum usage of energy in day-to-day lives by millions of us actively using the cloud – without that the laudable goal of clean and renewable energy will always remain a dream.

New Tungsten Software Releases for MySQL and PostgreSQL

I would like to announce a couple of new Tungsten versions available for your database clustering enjoyment.  As most readers of this blog are aware, Tungsten allows users to create highly available data services that include replicated copies, distributed management, and application connectivity using unaltered open source databases.   We are continually improving the software and have a raft of new features coming out this year.  

First, there is a new Tungsten 1.2.3 maintenance release available in both commercial as well as open source editions.  You can get access to the commercial version on the Continuent website, while the open source version is available on SourceForge

 The Tungsten 1.2.3 release focuses on improvements for MySQL users including the following:
  • Transparent session consistency for multi-tenant applications.  This allows applications that follow some simple conventions like sharding tenant data by database to get automatic read scaling to slaves without making code changes.
  • A greatly improved script for purging history on Tungsten Replicator. 
  • Fixes to binlog extraction to handle enum and set data types correctly. 
By far the biggest improvement in this release is Tungsten product documentation, including major rewrites for the guides covering management and connectivity.  Even the Release Notes are better.  If you want to find out how Tungsten works, start with the new Tungsten Concepts and Administration Guide

Second, there's a new Tungsten 1.3 release coming out soon.  Commercial versions are already in use at selected customer sites, and you can build the open source version by downloading code from SVN on SourceForge

The Tungsten 1.3 release sports major feature additions in the following areas: 
  • A new replicator architecture that allows you to manage non-Tungsten replication and also to configure very flexible replication flows to use multi-core systems more effectively and implement complex replication topologies.  The core processing loop for replication can now cycle through 700,000 events per second on my laptop--it's really quick. 
  • Much improved support for PostgreSQL warm standby clustering as well as provisional management of new PostgreSQL 9 features like streaming replication and hot standby.  
  • Replication support for just about everything in the MySQL binlog:  large transactions, unsigned characters, session variables, various permutations of character sets and binary data, and ability to download binlog files through the MySQL client protocol.  If you can put it in the binlog we can replicate it.  
We also have provisional support for Drizzle thanks to Markus Ericsson, plus a raft of other improvements.  This has been a huge amount of work all around, so I hope you'll enjoy the results.

P.s., Contact Continuent if you want to be a beta test site for Tungsten 1.3. 

Replication and More Replication at 2010 MySQL Conference

Database replication is still interesting after all these years.  Two of my talks focused on replication technology were accepted for the upcoming MySQL 2010 Conference.  Here are the summaries.
The first talk is a solo presentation covering Tungsten, which creates highly available and scalable database clusters using vanilla MySQL databases linked by flexible replication.  I'll describe how it works and some cool things you can do like zero-downtime upgrades and session-based performance scaling.   If you want to know how Tungsten can help you, this is a good time to find out.

The second talk is a joint effort with Jay Pipes covering issues like big data that are driving replication technology and the solutions to these problems available to MySQL users.  We'll lay out our vision of where things are going to try to help you pick the right technology for your next project.   Jay and I are also soliciting input on this talk from the Drizzle community among others.  If you are interested check out the thread on drizzle-discuss or post to this blog.

Finally, I'll be around for much of the MySQL conference, so if you are interested in Tungsten or data replication in general or just want to hang out, please look me up.   See you in Santa Clara!

Tungsten and PostgreSQL 9 at PG-East Conference

My Continuent colleagues Linas Virbalas and Alex Alexander will be giving a talk entitled Building Tungsten Clusters with PostgreSQL Hot Standby and Streaming Replication later this week at the PG-East Conference in Philadelphia.   I saw the demo last week and it's quite impressive.  You can flip the master and slaves for maintenance, open slaves for reads, failover automatically, etc.  It's definitely worth attending if you are in Philly this week.

Looking beyond the conference, we plan to be ready to support Tungsten clusters on PostgreSQL 9 as soon as it goes production.   Everything we have seen so far indicates that the new log streaming and hot standby features are going to be real hits.  They not only help applications, but from a clustering perspective queryable slaves with minimal replication lag are also a lot easier to manage.  Alex and Linas will have more to say about that during their presentation. 

Meanwhile, I'm sorry to miss the PG-East conference but wish everyone who will be attending a great time.  See you later this year at PG-West!