Wikipedia:Ligazóns mortas: Diferenzas entre revisións

Explorar o historial de forma interactiva

← Edición máis vella Edición máis nova →

Contido eliminado Contido engadido

En liña

Revisión como estaba o 6 de xuño de 2016 ás 07:57

Esta páxina ou sección está a editarse nestes intres. Para evitar posibles conflitos de edición, non edites esta páxina ou sección mentres vexas esta mensaxe. Revisa o historial de edicións para saber quen traballa nela. O usuario Banjo (conversa · contribucións) realizou a última edición na páxina hai 7 anos.
O tempo máximo de presenza deste marcador é dun mes dende a última edición do usuario que o puxo; pasado ese tempo debe retirarse.

Atallo: WP:LM

Esta páxina en resumo: As ligazóns mortas poden prexudicar as referencias, pero existen métodos dispoñibles para reducir ou reparar este tipo de problemas. Non se debe eliminar información referenciada polo único motivo de que a URL da fonte non estea xa dispoñible.

Ó igual que moitas outras grandes páxinas web, a Wikipedia sofre o fenómeno coñecido como ligazóns mortas ou podremia de ligazóns (inglés: link rot). As ligazóns externas, habitualmente empregadas como referencias e citas, pasan a ser gradualmente irrelevantes ou rotas (tamén chamadas mortas), segundo as webs ligadas desaparecen, cambian o seu contido ou trasládanse de dominio ou dirección. Isto presenta unha ameaza para a fiabilidade da Wikipedia e para os métodos de citar as fontes.

O esforzo requirido para previr as ligazóns mortas é significativamente menor que o esforzo requirido para reparar ou mitigar unha ligazón morta. Polo tanto, a prevención das ligazóns mortas fortaleze a enciclopedia. Esta páxina fornece estratexias e métodos para previr as ligazóns mortas antes de que aparezan, incluíndo o uso de servizos de arquivo web e o uso correcto dos modelos de citas. Recoméndase tamén engadir unha ligazón de arquivo como parte de cada referencia web, ou enviar a URL referenciada para o seu arquivo,^{[note 1]} no momento en que se crea ou actualiza unha referencia.

Porén, as ligazóns mortas non poden previrse sempre, polo que tamén se explica nesta páxina como mitigar este problema atopando ligazóns arquivadas previamente e outras fontes alternativas. Non se debe eliminar información referenciada soamente porque a URL da fonte non estea xa dispoñible. A verificabilidade non require que tódala información referenciada inclúa unha ligazón dispoñible, nin require que a fonte estea publicada en liña. Agás as URLs das seccións de ligazóns externas non usadas para referenciar contido nos artigos, non se debe eliminar unha URL soamente polo feito de que non estea xa dispoñible.

Previr as ligazóns mortas

Ó escribir e desenvolver un artigo, pódense previr as ligazóns mortas de varias formas. A primeira forma é evitar as URLs baleiras, indicando o título, autor, editorial, data e calquera outra información relevante da fonte. Así, se a ligazón deixa de funcionar a información engadida poderá axudar ós futuros Wikipedistas a localizar unha nova fonte para o texto orixinal, ben en liña ou ben nunha copia impresa. Isto resulta moi complicado ou imposible cunha URL baleira illada que non achega información adicional da fonte de información.

Servizos de arquivo web

A second way to prevent link rot is to use a web archiving service. The two most popular services are the Wayback Machine, which crawls and archives many web pages as well as having a form to suggest a URL to be archived,^{[note 1]} and WebCite, which provides on-demand web archiving. These services collect and preserve web pages for future use even if the original web page is moved, changed, deleted, or placed behind a pay wall. Web archiving is especially important when citing web pages that are unstable or prone to changes, like time sensitive news articles or pages hosted by financially distressed organizations. Once you have the URL for the archived version of the web page, use the archiveurl= and archivedate= parameters in the citation template that you are using. The template will automatically incorporate the archived link into reference.

Dubner, Stephen J. (January 24, 2008). "Wall Street Journal Paywall Sturdier Than Suspected". The New York Times Company. Consultado o 2009-10-28.
Dubner, Stephen J. (January 24, 2008). "Wall Street Journal Paywall Sturdier Than Suspected". The New York Times Company. Arquivado dende o orixinal o 2011-08-15.

However, not every web page can be archived. Webmasters and publishers may use a Robots exclusion standard in their domain to disallow archiving, or rely on complicated JavaScript, Flash, or other code that can't easily be copied. In these cases, alternate methods of preserving the data may be available.

Robots.txt

A quirk in the way the Wayback Machine operates means archived copies of sites sometimes become unavailable, for example, the Freakonomics blog previously hosted at freakonomics.blogs.nytimes.com. Those URLs were then excluded from archiving by the New York Times' robots.txt file. This had the unfortunate effect of also rendering the previously archived content unavailable. But robots.txt changes can unhide that which previous changes have hidden, so do not delete an archiveURL solely because the archived content is currently unavailable. Luckily, in this case, not only can the content be found on a new site that is still open to archiving, but the site's robots.txt later changed to allow archiving again, and so the old archives are now unhidden (example).

Métodos alternativos

Most citation templates have a quote= parameter that can be used to store text quotes of the source material. This can be used to store a limited amount of text from the source within the citation template. This is especially useful for sources that cannot be archived with web archiving services. It can also provide insurance against failure of the chosen web archiving service.

Dubner, Stephen J. (January 24, 2008). "Wall Street Journal Paywall Sturdier Than Suspected". The New York Times Company. Arquivado dende o orixinal o 2008-04-30. ...the Wall Street Journal will not, as has been widely speculated, tear down its paywall entirely...

When using the quote parameter, choose the most succinct and relevant material possible that preserves the context of the reference. Storing the entire text of the source is not appropriate under fair use policies, so choose only the most important portions of the text that most support the assertions in the Wikipedia article.

A quote also helps searching for other on-line versions of the source in the event that the original is discontinued.

Where applicable, public domain materials can be copied to Wikisource.

Repairing a dead link

Atallo: WP:DEADLINK

There are several ways to try to repair a dead link, detailed below:

Site search

Often web pages have simply moved, either in connection with a migration to a new server, or through general site maintenance. A site index or site-specific search feature is a useful place to locate the moved page.

Search engine

A search engine query using the title of the page, possibly with a search restriction to the same site, might also find the page. Using the examples from above, a web search (such as Google, Yahoo, etc.) might look like one of these:

site:freakonomics.blogs.nytimes.com/ "Wall Street Journal Paywall Sturdier Than Suspected"

site:nytimes.com/ "Wall Street Journal Paywall Sturdier Than Suspected"

"Wall Street Journal Paywall Sturdier Than Suspected"

Also, a search for some components of the dead link with punctuation removed is often fruitful; e.g. a search through Google for

groups.csail.mit.edu sFFT paper pdf

leads to a page enabling Modelo:Diff. A search for an unusual or unique-looking substring of the URL, such as just the filename at the end, is often fruitful.

Internet archives

Check for archived versions of the page in the archiving services. If you find an archived version of the dead link, double-check to make sure that citation still supports the article text. It is also a good idea to consult the access date of the citation (if it was specified, or a history search for when it was added) to see how contemporaneous this archived version is to the link when it was cited.

The following archiving services are considered to be reliable:

The Mementos interface allows you to search multiple archiving services for archived versions of some pages with a single request using the Memento protocol. Unfortunately, the Mementos webpage interface removes any parameters which are included with the URL. If the URL contains a "?" it is unlikely to work properly when entered manually without changes. When entering the URL into the Mementos interface manually, the most common change needed is to change "?" to "%3F". While making only this change will not be sufficient in all cases, it will work most of the time. The bookmarklet in the table below will properly encode URLs such that searches will work. Mementos looks like it is, or at least will be, very convenient. However, if archives are not found at Mementos, it should not be the only site checked. Mementos can sometimes return no results when archives exist at sites which it normally includes. An example of this is trying to find archives of Battle of the Atlantic. Modelo:As of, Archive.org reports it has 63 or 64 archives (https, http). Mementos reports 0 archives (https, http). Mementos usually finds archives at Archive.org, but sometimes Mementos does not even when archives exist. If you try Mementos first, don't assume that there really are no archives if Mementos reports that there are none.

There are many Internet archive projects in existence.

When multiple archive dates are available, try to use the one that is most likely to be the contents of the page seen by the editor who entered the reference on the |accessdate=. If that parameter is not specified, a search of the article's revision history can be performed to determine when the link was added to the article.

View the archive to verify that it contains valid page information. Sometimes archives are actually archives of the fact that the link is dead, or that the archiving failed. If this is the case, try using an archive from a different date. Usually dates closer to the time the link was placed in the Wikipedia page, or earlier, are more likely to show valid information. Different archiving sites should also be tried.

For most citation templates, archives are entered using the |archiveurl=, |archivedate= and |deadurl= parameters. The primary link is automatically switched to the archive when |deadurl=yes. This retains the original link location for reference.

Bookmarklets to check common archive sites for archives of the current page
(all open in a new tab or window)
Archive site	Bookmarklet
Archive.org	javascript:void(window.open('https://web.archive.org/web/*/'+location.href))
UKGWA	javascript:void(window.open('http://webarchive.nationalarchives.gov.uk/*/'+location.href))
WebCite	javascript:void(window.open('http://www.webcitation.org/query.php?url='+location.href))
Wikiwix	javascript:void(window.open('http://archive.wikiwix.com/cache/?url='+location.href))
Mementos interface	javascript:void(window.open('http://www.webarchive.org.uk/mementos/search/'+encodeURIComponent(location.href)+'?referrer='+encodeURIComponent(document.referrer)))

The following archiving services are not permitted on the English Wikipedia:

archive.is See WP:Archive.is RFC and WP:Archive.is RFC 3 for more information.

Mitigating a dead link

Atallo: WP:MDLI

At times, all attempts to repair the link will be unsuccessful. In that event, consider finding an alternate source so that the loss of the original does not harm the verifiability of the article. Alternate sources about broad topics are usually easily located. A simple search engine query might locate an appropriate alternative, but be extremely careful to avoid citing mirrors and forks of Wikipedia itself, which would violate Wikipedia:Verifiability.

Sometimes, finding an appropriate source is not possible, or would require more extensive research techniques, such as a visit to a library or the use of a subscription-based database. If that is the case, consider consulting with Wikipedia editors at Wikipedia:WikiProject Resource Exchange, the Wikipedia:Village pump, or Wikipedia:Help desk. Also, consider contacting experts or other interested editors at a relevant WikiProject.

Keeping dead links

Atallo: WP:KDL

A dead, unarchived source URL may still be useful. Such a link indicates that information was (probably) verifiable in the past, and the link might provide another user with greater resources or expertise with enough information to find the reference. It could also return from the dead. With a dead link, it is possible to determine if it has been cited elsewhere, or to contact the person originally responsible for the source. For example, one could contact the Yale Computer Science department if http://www.cs.yale.edu/~EliYale/Defense-in-Depth-PhD-thesis.pdf^{[Ligazón morta]} were dead. Place {{dead link}} after the dead URL and just before the </ref> tag if applicable, leaving the original link intact. Placing {{dead link}} auto-categorizes the article into Articles with dead external links project category, and into specific monthly date range category based on |date= parameter. Do not delete a URL just because it has been tagged with {{dead link}} for a long time.

Notas

↑ ^1,0 ^1,1 Using the web form at https://archive.org, enter a URL and click "browse history". This will either redirect to show the latest previously archived copy, present a box near the bottom of the page with a link inviting the user to "save this URL in the Wayback Machine", display a calendar showing the extent of previously archived content for that URL, or show an error message explaining why the URL cannot be archived. If archiving is attempted and ultimately successful, the archived copy usually becomes available within minutes.
Alternately, you can use the bookmarklets listed at Wikipedia:Citing sources/Further considerations#Archiving bookmarklets. The bookmarklets enable you to cause the page that you are viewing to be archived with a single click. A new tab will open with the progress of the archiving without disturbing the tab you are using to view the to-be-archived page. Bookmarklets are available for both Archive.org (the Wayback Machine) and WebCite.

[ia_form-1] 1,0 ^1,1 Using the web form at https://archive.org, enter a URL and click "browse history". This will either redirect to show the latest previously archived copy, present a box near the bottom of the page with a link inviting the user to "save this URL in the Wayback Machine", display a calendar showing the extent of previously archived content for that URL, or show an error message explaining why the URL cannot be archived. If archiving is attempted and ultimately successful, the archived copy usually becomes available within minutes.
Alternately, you can use the bookmarklets listed at Wikipedia:Citing sources/Further considerations#Archiving bookmarklets. The bookmarklets enable you to cause the page that you are viewing to be archived with a single click. A new tab will open with the progress of the archiving without disturbing the tab you are using to view the to-be-archived page. Bookmarklets are available for both Archive.org (the Wayback Machine) and WebCite.

[note 1]

@@ Liña 8: / Liña 8: @@
 Porén, as ligazóns mortas non poden previrse sempre, polo que tamén se explica nesta páxina como mitigar este problema atopando ligazóns arquivadas previamente e outras fontes alternativas. '''Non se debe eliminar''' información referenciada soamente porque a URL da fonte non estea xa dispoñible. ''A [[Wikipedia:Verificabilidade|verificabilidade]] non require que tódala información referenciada inclúa unha ligazón dispoñible, nin require que a fonte estea publicada en liña.'' Agás as URLs das seccións de [[Wikipedia:Ligazóns externas|ligazóns externas]] non usadas para referenciar contido nos artigos, '''non se debe eliminar''' unha URL soamente polo feito de que non estea xa dispoñible.
+==Previr as ligazóns mortas==
-==Preventing link rot==
+Ó escribir e desenvolver un artigo, pódense previr as ligazóns mortas de varias formas. A primeira forma é '''evitar as URLs baleiras''', indicando o título, autor, editorial, data e calquera outra información relevante da fonte. Así, se a ligazón deixa de funcionar a información engadida poderá axudar ós futuros Wikipedistas a localizar unha nova fonte para o texto orixinal, ben en liña ou ben nunha copia impresa. Isto resulta moi complicado ou imposible cunha URL baleira illada que non achega información adicional da fonte de información.
-{{atallo|WP:PLRT}}
-As you [[Wikipedia:Article development|write articles]], you can help prevent link rot in several ways. The first way to prevent link rot is to '''avoid [[Wikipedia:Bare URLs|bare URLs]]''' by recording as much of the exact '''title''', '''author''', '''publisher''' and '''date''' of the source as possible. Optionally, also add the '''accessdate'''. If the link goes bad, this added information can help a future Wikipedian, either editor or reader, locate a new source for the original text, either online or a print copy. This likely wouldn't be possible with only an isolated, bare [[URL]] that no longer worked. Local and school libraries are a good resource for locating such offline sources. Many local libraries have in-house subscriptions to digital databases or inter-library loan agreements, making it easier to retrieve hard-to-find sources.
+===Servizos de arquivo web===
-''As you edit, if an article has bare URLs in its citations, fix them or at least tag the References section with {{tl|linkrot}} as a reminder to complete citation details as above, and to categorize the article as needing cleanup.''
-===Web archive services===
 A second way to prevent link rot is to use a [[web archiving]] service. The two most popular services are the [[Wikipedia:Using the Wayback Machine|Wayback Machine]], which crawls and archives many web pages as well as having a form to suggest a URL to be archived,<ref name="ia_form" group="note">Using the web form at https://archive.org, enter a URL and click "browse history". This will either redirect to show the latest previously archived copy, present a box near the bottom of the page with a link inviting the user to "save this URL in the Wayback Machine", display a calendar showing the extent of previously archived content for that URL, or show an error message explaining why the URL cannot be archived. If archiving is attempted and ultimately successful, the archived copy usually becomes available within minutes.<br/>
 Alternately, you can use the bookmarklets listed at [[Wikipedia:Citing sources/Further considerations#Archiving bookmarklets]]. The bookmarklets enable you to cause the page that you are viewing to be archived with a single click. A new tab will open with the progress of the archiving without disturbing the tab you are using to view the to-be-archived page. Bookmarklets are available for both Archive.org (the Wayback Machine) and WebCite.</ref> and [[Wikipedia:Using WebCite|WebCite]], which provides on-demand web archiving. These services collect and preserve web pages for future use even if the original web page is moved, changed, deleted, or placed behind a [[pay wall]]. Web archiving is especially important when [[WP:Citing sources|citing]] web pages that are unstable or prone to changes, like time sensitive [[news]] articles or pages hosted by financially distressed organizations. Once you have the URL for the archived version of the web page, use the <code>archiveurl=</code> and <code>archivedate=</code> parameters in the [[Wikipedia:Citation templates|citation template]] that you are using. The template will automatically incorporate the archived link into reference.
@@ Liña 24: / Liña 21: @@
 A quirk in the way the Wayback Machine operates means archived copies of sites sometimes become unavailable, for example, the [https://en.wikipedia.org/wiki/Freakonomics#Freakonomics_blog Freakonomics blog] previously hosted at <code>freakonomics.blogs.nytimes.com</code>. Those URLs were [https://web.archive.org/web/20130928002127/http://www.freakonomics.blogs.nytimes.com/robots.txt then] excluded from archiving by the New York Times' robots.txt file. This had the unfortunate effect of also rendering the previously archived content unavailable. But robots.txt changes can unhide that which previous changes have hidden, so do not delete an archiveURL solely because the archived content is currently unavailable.  Luckily, in this case, not only can the content be found on a new site that is still open to archiving, but the site's robots.txt later [https://web.archive.org/web/20131217041219/http://freakonomics.com/robots.txt changed] to allow archiving again, and so the old archives are now unhidden ([https://web.archive.org/web/20070623110922/http://freakonomics.blogs.nytimes.com/ example]).
-===Alternative methods===
+===Métodos alternativos===
 Most [[Wikipedia:Citation templates|citation templates]] have a <code>quote=</code> parameter that can be used to store text quotes of the source material. This can be used to store a limited amount of text from the source within the citation template. This is especially useful for sources that cannot be archived with web archiving services. It can also provide insurance against failure of the chosen web archiving service.