First Line Software is a premier provider of software engineering, software enablement, and digital transformation services. Headquartered in Cambridge, Massachusetts, the global staff of 450 technical experts serve clients across North America, Europe, Asia, and Australia.
Content migration can sometimes be tricky and even downright messy (with good reason) . In fact, large scale content migration is often the most difficult aspect of implementing a new CMS. Having worked on some challenging migration projects for many of our clients, we’d like to share a few practical tips on how to make life a little easier for teams involved with particularly large and demanding CMS migrations.
First off, content migration should be executed as early as possible and presented to editors in the test environment. For large sites with thousands of pages, a 100% verification of their look and feel is hardly possible. However, starting the review process earlier helps with broadening test coverage and with identifying fresh migration-generated/CSS-related issues sooner.
Also, in order to migrate a large website with constantly generated and updated content, you may want to make your rollout sequence look something like this:
- Initial rollout of the configured CMS + full migration of all content created up-to-date into the staging environment
- Extensive testing + manual site modification by editors. Certain pieces of content typically have to be migrated manually (side blocks, teasers, home pages, etc.)
- Bugfixing release + delta migration for the content created since the previous migration (#2 and #3 can be iterated as many times as necessary to reach release candidate status)
- DNS switching to new CMS (up until that point you are double-publishing the content to both old and new site while “freezing” user-generated content)
Let’s also touch on the issue of the old URL structure. Most large sites today have a wide variety of content: html, RSS, images, PDF files, video, etc. When moving to a new CMS, in most cases it is simply too expensive to preserve the URL structure 100% , even if the new CMS allows for that. A less costly and a more practical avenue to take might be identifying the types of content for which URL mapping can be preserved automatically, as well as the exceptional cases where the URLs can be fixed manually:
- Analyze your inbound link structure
- Identify the top URLs where your visitors are landing: search engines, referrals, bookmarks, etc.
- Identify subscription statistics
- Obviously, Google Analytics and various other webmaster tools can help here.
And another tip: If you have a team that is skinning/customizing the new CMS with off-the-shelf plugins or even writing their own code, please keep in mind that they may not always be diligent about documenting their changes in database schemas. To mitigate that, migration engineers should be part of this team from the very beginning to facilitate “osmotic communication”.