Comments (12)
👍
Something to discuss with @alexmakarski
from st2docs.
Agreed. Need to have banner across top of old docs saying "THESE ARE OLD" or something like that - see the way http://docs.influxdata.com/influxdb/v1.1/ does it.
Probably needs robots.txt
or similar updated to stop google indexing old versions?
from st2docs.
Personally I would prefer to make it difficult to even land on an old page. So I'd do both - put up a big banner saying "this is old", similar to how influx does it, but in addition, force the user to go out of their way to get to an old doc. So they won't even get there unless they're meaningfully doing it.
Even worse - and I haven't had time to dig into why this is - our docs will display an old version of the docs even if the URL says "latest":
I got to this page simply by googling "stackstorm actions". That definitely shouldn't happen
from st2docs.
@LindsayHill I wouldn't hastily de-index the old stuff, especially if it's bringing in traffic. I think the Influx approach is more sound.
from st2docs.
@Mierdin I think that you got those old versions because of some page re-arrangement we did. What was /runners.html got moved to /reference/runners.html. It looks like the build processes are only additive though - they're not deleting previously built content.
from st2docs.
If the old stuff is bringing in traffic, is that really traffic that we want? At best, those situations are providing two google results - one with the correct version, and one with the old version - forcing the user to choose. At worst, users are finding references to concepts that have either radically changed or simply don't exist anymore. I don't think that traffic is what we're after.
@LindsayHill is probably right, and we just need to clean up the old stuff somehow
from st2docs.
I don't know if the cleanup is something easily automated (e.g. do a make clean; make docs
). Depends how the sync with S3 step is done I guess - I don't know the details of that workflow
from st2docs.
This robots.txt
should work to disallow Google indexing versioned doc URLs:
User-agent: *
Disallow: /0.*/
Disallow: /1.*/
Disallow: /2.*/
Disallow: /3.*/
Disallow: /4.*/
Disallow: /5.*/
from st2docs.
We could look at using the <priority>
tags in sitemap.xml - see https://www.sitemaps.org/protocol.html
from st2docs.
I think there's 3 separate, but related issues here. These are:
- Some doc pages have been moved/deleted over time, especially around v1.6->2.0 transition. The docs build & deployment process does not delete those pages, so we end up with things like an old version of docs.stackstorm.com/policies.html and the updated content at docs.stackstorm.com/reference/policies.html
- On our old docs pages, we don't give users any indication that they are looking at outdated content.
- Google is indexing version-specific docs, e.g. docs.stackstorm.com/1.6/actions.html, rather than docs.stackstorm.com/actions.html. We need to get Google to prefer to use the current docs, or possibly /latest (see more below).
This is what I think we need to do:
-
Create placeholder pages containing the equivalent of
<meta http-equiv="Refresh" content="0; url=http://www.example.com/" />
for those pages that have been moved. Ideally do this in Restructuredtext, so output gets auto-generated. Or we could do it via AWS API for S3 - see https://medium.com/cloud-io/put-redirect-301-in-s3-cloudfront-hosted-website-39bd9104ddbc. This would be a one-off manual task. Would require doing again in future if any other pages get moved.
By putting in the 301 redirects, we'll steer Google & users to new content, and not lose rank. #453 -
Figure out some way to auto-insert a 'this doc is old, go to location instead' banner in old docs branches. Needs to be something that gets done when we go through release branch process, as last change made to old version. - #455
-
I think there's 3 options here for steering Google towards latest docs:
- Configure
robots.txt
as above, to disallow searching under version-specific branches - Configure Canonical URL in our docs, point that to docs.stackstorm.com/. Should be able to do that with our theme.
- Configure priority in sitemap.xml. Not sure if I can auto-generate that though.
I think we should start with the second option. Relatively low impact. I'm not against also configuring
robots.txt
. - Configure
Related: I've been wondering if we should use /latest
as the latest stable docs branch, not the dev branch. Then we could have canonical URLs, etc pointing to /latest
. Not sure on that.
from st2docs.
PR submitted to address item 2, redirects #453
Now I just need to figure out banners on old content...
from st2docs.
Bunch of Pull requests have gone in to address this, and things are looking much better. Haven't done final testing with version picker, but this looking much happier. Google searches are now leaning towards the current docs, and redirects are in place.
from st2docs.
Related Issues (20)
- Orquesta not listed on available runners
- make docker-build missing curl
- Some part of captions are missing on the left side navigation menu. HOT 3
- Global RBAC permissions needs to be documented HOT 2
- Use of env variable for actions has no example HOT 1
- Migrate from CircleCI to GH Actions
- Migrate docs.stackstorm.com to GH static pages HOT 10
- StackStorm-HA Docs: Update reference links for Mongo/RabbitMQ dependencies helm charts for accuracy
- Centos 8 EOL HOT 2
- Documented instructions for RBAC role permissions are incorrect HOT 9
- Fix link structure of Orquesta Docs in the docs sidebar HOT 1
- how to solve curl 60 HOT 1
- StackStorm HA `helm install` instructions
- Docs Ubuntu 20.04 HOT 1
- Rules Criteria Documentation Clarification
- Drop Ubuntu 18.04 support
- Drop CentOS7 (RHEL7) support
- Add RockyLinux 9 (RHEL9)
- Add Ubuntu 22.04 support
- Remove all references of CentOS
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from st2docs.