Presentation: HUGE! A Drupal site with 381 modules, 174GB MySQL database, and 200 million row tables

Published Mon, 2012/11/19 - 10:18, Updated Thu, 2013/02/21 - 14:03

Here are the slides from my Drupal Camp Toronto 2012 session.

The site is ScienceScape.org, a repository of scientific research going back to the 19th century, down to the latest biotechnology and cancer release.

Update: You can watch a video of the presentation on Vimeo.

AttachmentSize
drupal-site-with-huge-database-and-lots-of-modules.pdf535.43 KB

Can we hear/see this in audio/video format?

Amzed with the presentation! Just wondering if we can have access to audio/video of the session?

Maybe later

Audio was recorded along with slides. However, it has not been posted yet. I will update the info above to add a link when I get it.

Hey, posted this to Reddit.

Hey, posted this to Reddit. Sorry if it seemed from the title that I was horrified by it- actually I thought the presentation and what you did was really cool and I was more aghast at the database size.

Great work! Thanks for posting it!

No worries

No worries. Glad it was clarified, and at least some people got something out of it. That is what it is all about: knowledge sharing within the community.

Great presentation with good

Great presentation with good ideas.

Due to some aggressive feature-usage on one of our sites, this script would be really interesting!

Great Slide Deck

Nice solution for the database import and reducing the number of installed modules by creating a mega feature.

Did you come up with a solution for PHP network calls in regards to Sphinx?

Working on it

My understanding is that their staff are working on reducing the number of calls to Sphinx.

You can check here for: Mydumper for fast MySQL backups.

Drupal 6 or 7 site?

Just curious, is this a Drupal 6 or 7 site? My guess is D6 because you list issues with module_hooks & module_implements. If it is D6 http://groups.drupal.org/node/187209 has a list of core patches "Big Performance Gains - Low Risk" that really help with these.

In regards to network traffic, if it is over http http://drupal.org/project/httprl/ is a trick that I use for speeding up parallel http requests. Also, if you have to do something that takes a long time using multiple processes to do it can help: http://groups.drupal.org/node/226054

Drupal 6

The site is Pressflow 6.

Is your Drupal or WordPress site slow?
Is it suffering from server resources shortages?
Is it experiencing outages?
Contact us for Drupal and WordPress Performance Optimization and Tuning Consulting