© 2008 Stephan M Spencer Netconcepts Unraveling URLs and Demystifying Domains presented by Stephan Spencer,

Slides:



Advertisements
Similar presentations
The Internet.
Advertisements

PHP Meetup - SEO 2/12/2009. Where to Focus? Ensuring the findability of content Ensuring content is well understood by search engines Maximizing the importance.
Performing a Technical SEO Audit. Audit SEO - plan de actiune Overview Gather Data Analyze Present Results.
SEO Best Practices with Web Content Management Brent Arrington, Services Developer, Hannon Hill Morgan Griffith, Marketing Director, Hannon Hill 2009 Cascade.
© 2008 Stephan M Spencer Netconcepts SEO Mistakes Most Bloggers Make By Stephan Spencer, Founder & President,
Marketer to Developer Translation SMX West February 9, 2009.
What is WEB SPAM Many slides from a lecture by Marc Najork, Microsoft: “Detecting Spam Web Pages”
The process of increasing the amount of visitors to a website by ranking high in the search results of a search engine.
Tutorial 14 Working with Forms and Regular Expressions.
WEB SCIENCE: SEARCHING THE WEB. Basic Terms Search engine Software that finds information on the Internet or World Wide Web Web crawler An automated program.
SEO Techniques Tech Talk 29 th August 2013 (By PEN Vannak)
IDK0040 Võrgurakendused I Building a site: Publicising Deniss Kumlander.
Todd Friesen April, 2007 SEO Workshop Web 2.0 Expo San Francisco.
Meta Tags What are Meta Tags And How Are They Best Used?
On-Site Strategies for Optimizing Your Local Business. sunclouddesign.com/talks.
© 2009 Stephan M Spencer Netconcepts 301 Redirect: How Do I Love You, Let Me Count the Ways presented by Stephan.
© 2008 Stephan M Spencer Netconcepts Bot Herding presented by Stephan Spencer, Founder & President, Netconcepts.
Blog Monetization: Soup to Nuts Stephan Spencer, Founder & President, Netconcepts.
Tutorial 14 Working with Forms and Regular Expressions.
Wasim Rangoonwala ID# CS-460 Computer Security “Privacy is the claim of individuals, groups or institutions to determine for themselves when,
SIMPLE ROUTER The slide made by Salim Malakouti. Next we will create the Router  What do I we mean by a router?  Routers work similar to a map. It receives.
© 2005 Stephan M Spencer Netconcepts RSS, Blogs and Search Marketing: Leveraging the Power of RSS.
© 2009 Stephan M Spencer Netconcepts Tools, Glorious Tools presented by Stephan Spencer, Founder & President,
JavaScript, Fourth Edition
© 2008 CrawlWall.com Competitive Counter-Intelligence Stop Snooping Competitors Techniques for protecting your SEO investment from prying competitive eyes.
© 2006 Stephan M Spencer Netconcepts Site Architecture and Internal Linking By Stephan Spencer, Founder &
© 2006 Stephan M Spencer Netconcepts Search Engine Marketing by Stephan Spencer President, Netconcepts.
Python CGI programming
Set 13: Web Servers (configuration and security) (Chapter 21) IT452 Advanced Web and Internet Systems.
Introduction Use of makefiles to manage the build process Declarative, imperative and relational rules Environment variables, phony targets, automatic.
© 2007 Stephan M Spencer Netconcepts Web Site Monetization Make Money While You Sleep.
Apache and... Virtual Hosts ---- aliases mod_rewrite ---- htaccess AFNOG 11 Kigali, Rwanda May 2010 Dorcas Muthoni Courtesy: Hervey Allen.
Chapter 8 Cookies And Security JavaScript, Third Edition.
Installing and Using MySQL and phpMyAdmin. Last Time... Installing Apache server Installing PHP Running basic PHP scripts on the server Not necessary.
SEO techniques & Mastering Google Adwords By Ganesh.S
Regular Expression (continue) and Cookies. Quick Review What letter values would be included for the following variable, which will be used for validation.
1 Search Engine Optimization An introduction to optimizing your web site for best possible search engine results.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
Continuing Education UCC Fall 2010 Search Engine Optimization.
Variables and ConstantstMyn1 Variables and Constants PHP stands for: ”PHP: Hypertext Preprocessor”, and it is a server-side programming language. Special.
_______________________________________________________________________________________________________________ PHP Bible, 2 nd Edition1  Wiley and the.
استاد : مهندس حسین پور ارائه دهنده : احسان جوانمرد Google Architecture.
Dynamic Sites for Fun and Profit Jake Baillie SES Chicago 2006.
Working with Forms and Regular Expressions Validating a Web Form with JavaScript.
Search Engines By: Faruq Hasan.
ASP. What is ASP? ASP stands for Active Server Pages ASP is a Microsoft Technology ASP is a program that runs inside IIS IIS stands for Internet Information.
SEO Friendly Website Building a visually stunning website is not enough to ensure any success for your online presence.
Ten Tips for Search Engine Marketing Stephan Spencer President, Netconcepts
Validation using Regular Expressions. Regular Expression Instead of asking if user input has some particular value, sometimes you want to know if it follows.
Fundamentals of Web DevelopmentRandy Connolly and Ricardo HoarFundamentals of Web DevelopmentRandy Connolly and Ricardo Hoar Fundamentals of Web DevelopmentRandy.
© 2007 Stephan M Spencer Netconcepts SEO Best Practices for Bloggers.
© 2009 Stephan M Spencer Netconcepts Duplicate Content & The Canonical Tag By Stephan Spencer, President &
Website Design, Development and Maintenance ONLY TAKE DOWN NOTES ON INDICATED SLIDES.
© 2007 Stephan M Spencer Netconcepts Making Blogging and RSS Pay Off Driving Traffic and Sales Through Optimized.
 SEO Terms A few additional terms Search site: This Web site lets you search through some kind of index or directory of Web sites, or perhaps both an.
Successful Site Architecture Matt Bailey SiteLogic
Created By EZ Marketing Tech 1 +1 (347) | |
Google Analytics Graham Triggs Head of Repository Systems, Symplectic.
Technical SEO tips for Web Developers Richa Bhatia Singsys Pte. Ltd.
Building Your Own Website Using:. Install & configure LAMP. Download WordPress and run it as a local website on your Raspberry Pi. Configure WordPress.
Search Engine Optimisation No Point having a lovely site and lovely content if no one can find it!
© 2005 Stephan M Spencer Netconcepts Search Engine Optimisation: Black Art or Sweet Science?
Search Engine Optimization
Managing State Chapter 13.
Search Engine Optimization
Unix System Administration
SEO Mistakes Most Bloggers Make
301 Redirect: How Do I Love You, Let Me Count the Ways
Presentation transcript:

© 2008 Stephan M Spencer Netconcepts Unraveling URLs and Demystifying Domains presented by Stephan Spencer, Founder & President, Netconcepts

© 2008 Stephan M Spencer Netconcepts Subdomains vs. Subdirectories  Matt's/Google's announcement – they'd essentially treat them as the same ( subdirectories/)  You shouldn't treat subdomains as a means of creating tons of easy thin-content microsites. They're being viewed as subdirectories. Yes, use them for managing your website and doing load balancing. No, don't use them purely for SEO reasons.

© 2008 Stephan M Spencer Netconcepts Microsites  Can be bad for your SEO if overly numerous or if they contain substantial amounts of duplicate content (merely changing the UI doesn’t count)  Can be good when you’ll get more link love –Hyphothetical example: stayinghealthy.com vs. stayinghealthy.metlife.com  Can also be beneficial in terms of demographic targeting and focused keyword targeting

© 2008 Stephan M Spencer Netconcepts Keywords in URLs  Beneficial in Google regardless of whether in filename/directory/subdirectory names versus variable values in querystrings.  In other search engines, more important that the keyword be in filename/directory/subdirectory. And, the closer the keyword(s) are to the root domain name, apparently the more weight they will lend.  Just because a keyword is bolded in the SERP doesn’t mean it’s given extra weight in the ranking algo.

© 2008 Stephan M Spencer Netconcepts Word Separators in URLs  Hyphens are the best. Preferred over underscores. –Historically to Google underscores were not word separators –Bare spaces cannot be used in URLs. Character encoded equivalents for "white space" character are + or %20. (e.g. blue%20widgets.htm). Regardless, hyphen is preferred.  Too much of a good thing looks like keyword stuffing –Aim for fewer than a half dozen words (i.e. <5 hyphens) –See my Matt Cutts interview (stephanspencer.com/search-engines/matt-cutts-interview)

© 2008 Stephan M Spencer Netconcepts URL Stability  An annually recurring feature, like a Holiday Gift Buying Guide, should have a stable URL –When the current edition is to be retired and replaced with a new edition, assign a new URL to the archived edition  Otherwise link juice earned over time is not carried over to future years’ editions

© 2008 Stephan M Spencer Netconcepts Domain Age and Expiry  Crusty old domains (and crusty old sites) are more trusted by Google, as alluded to in Google’s "Information retrieval based on historical data” patentInformation retrieval based on historical data –Parked domains aren’t as trusted. Start the clock running.  Number of years that your domain name has before expiring may very well be a big quality indicator. –Suggest increasing the registration period for your domain so the expiration date will be further in the future –Particularly for newer domains

© 2008 Stephan M Spencer Netconcepts Domain Age and Expiry –Domainers often have been known to do "tasting” (i.e. registering domains for just a couple of days to see what keyword traffic they get) –Google just announced that they'll stop displaying AdSense ads on domain tasting sites as a measure to try to fight the practice ( = )

© 2008 Stephan M Spencer Netconcepts Rewriting Your Spider-Unfriendly URLs  3 approaches: 1)Use a “URL rewriting” server module / plugin – such as mod_rewrite for Apache, or ISAPI_Rewrite for IIS Server 2)Recode your scripts to extract variables out of the “path_info” part of the URL instead of the “query_string” 3)Or, if IT department involvement must be minimized, use a proxy server based solution (e.g. Netconcepts' GravityStream) –With (1) and (2), replace all occurrences of your old URLs in links on your site with your new search-friendly URLs. 301 redirect the old to new URLs too, so no link juice is lost.

© 2008 Stephan M Spencer Netconcepts Let’s Geek Out!

© 2008 Stephan M Spencer Netconcepts URL Rewriting – Under the Hood  If running Apache, place “rules” within.htaccess or your Apache config file (e.g. httpd.conf, sites_conf/…) –RewriteEngine on –RewriteBase / –RewriteRule ^products/([0-9]+)/?$ /get_product.php?id=$1 [L] –RewriteRule ^([^/]+)/([^/]+)\.htm$ /webapp/wcs/stores/servlet/ProductDisplay?storeId=10001&c atalogId=10001&langId=-1 &categoryID=$1&productID=$2 [QSA,P,L]

© 2008 Stephan M Spencer Netconcepts URL Rewriting – Under the Hood  The magic of regular expressions / pattern matching –* means 0 or more of the immediately preceding character –+ means 1 or more of the immediately preceding character –? means 0 or 1 occurrence of the immediately preceding char –^ means the beginning of the string, $ means the end of it –. means any character (i.e. wildcard) –\ “escapes” the character that follows, e.g. \. means dot –[ ] is for character ranges, e.g. [A-Za-z]. –^ inside [] brackets means “not”, e.g. [^/]

© 2008 Stephan M Spencer Netconcepts URL Rewriting – Under the Hood –() puts whatever is wrapped within it into memory –Access what’s in memory with $1 (what’s in first set of parens), $2 (what’s in second set of parens), and so on  Regular expression gotchas to beware of: –“Greedy” expressions. Use [^ instead of.* –.* can match on nothing. Use.+ instead –Unintentional substring matches because ^ or $ wasn’t specified

© 2008 Stephan M Spencer Netconcepts URL Rewriting – Under the Hood  Proxy page using [P] flag –RewriteRule /blah\.html$ [P]  [QSA] flag is for when you don’t want query string params dropped (like when you want a tracking param preserved)  [L] flag saves on server processing  Got a huge pile of rewrites? Use RewriteMap and have a lookup table as a text file

© 2008 Stephan M Spencer Netconcepts If You’re on Microsoft IIS Server  ISAPI_Rewrite not that different from mod_rewrite  In httpd.ini : –[ISAPI_Rewrite] RewriteRule ^/category/([0-9]+)\.htm$ /index.asp?PageAction=VIEWCATS&Category=$1 [L] –Will rewrite a URL like &Category=207 to something like

© 2008 Stephan M Spencer Netconcepts Redirects – Under the Hood  In.htaccess (or httpd.conf), you can redirect individual URLs, the contents of directories, entire domains… : –Redirect 301 /old_url.htm –Redirect 301 /old_dir/ –Redirect 301 /  Pattern matching can be done with RedirectMatch 301 –RedirectMatch 301 ^/(.+)/index\.html$

© 2008 Stephan M Spencer Netconcepts Redirects – Under the Hood  Or use a rewrite rule with the [R=301] flag –RewriteCond %{HTTP_HOST} !^www\.example\.com$ [NC] –RewriteRule ^(.*)$ [L,QSA,R=301]  [NC] flag makes the rewrite condition case-insensitive

© 2008 Stephan M Spencer Netconcepts Conditional Redirects, Under the Hood  Selectively redirect bots that request URLs with session IDs to the URL sans session ID: –RewriteCond %{QUERY_STRING} PHPSESSID RewriteCond %{HTTP_USER_AGENT} Googlebot.* [OR] RewriteCond %{HTTP_USER_AGENT} ^msnbot.* [OR] RewriteCond %{HTTP_USER_AGENT} Slurp [OR] RewriteCond %{HTTP_USER_AGENT} Ask\ Jeeves RewriteRule ^/(.*)$ /$1 [R=301,L]  Utilize browscap.ini instead of having to keep up with each spider’s name and version changes

© 2008 Stephan M Spencer Netconcepts URLs that Lead to Error Pages  Traditional approach is to serve up a 404, which drops that obsolete or wrong URL out of the search indexes. This squanders the link juice to that page.  But what if you return a 200 status code instead, so that the spiders follow the links? Then include a meta robots noindex so the error page itself doesn’t get indexed.  Or do a 301 redirect to something valuable (e.g. your home page) and dynamically include a small error notice?

© 2008 Stephan M Spencer Netconcepts Thanks!  This Powerpoint can be downloaded from  For 180 minute long screencast (including 90 minutes of Q&A) on SEO for large dynamic websites (taught by myself and Chris Smith) – including transcripts –  Questions after the show? me at