Site urls
Usage
Usually, to get of all site pages, simple enter any of its page in "Site" field and click on "Get site pages" button.
Read the next section, if you can not get the site pages for some reason.
How service works
In most cases, each site has a file, that consists of all inner links and called Sitemap. As a rule, it is located at [site]/sitemap.xml (ex.: vivazzi.pro/sitemap.xml). Using this file this service extract all inner page links.
Also, the service takes into account sites with a large number of pages that have multiple sitemap in one main sitemap. Sometimes the child sitemap files in the main Sitemap file have an extension other than .xml (for example, .zip to archive files). In this case, the service will ignore these files and will issue a corresponding message.
In normally, path to file is specified in [site]/robots.txt in Sitemap
section, for example vivazzi.pro/robots.txt:
User-agent: * Host: https://vivazzi.pro Sitemap: https://vivazzi.pro/sitemap.xml
In rare cases, site developers can use another location for the Sitemap. In this case, the service will try to find the file specified in the robots.txt. If robots.txt is not available or sitemap file specified in robots.txt does not exists, the service can not display site pages, since service does not automatically crawl pages from site's links, as search engines (Google, Yandex and so on) or spider programs (majento, xenu and so on).
If you did not get the site page, try using different spiders, but it's probably hard for an ordinary user to understand.
There is also a way to get all the links of the site through the Google or Yandex search engine by typing in the address bar a query:
site:[site_name]
For example, site:vivazzi.pro (see more details about site:
command on page: Исключить поддомены командой site: в google)
But this method has a disadvantage: displays only those pages that are included in the search , and the remaining pages will be ignored if they did not enter the search (not indexed) for some reason.
Also you can find all the links on the page using different services. For example: pr-cy.ru/link_extractor - will show internal and external links on the page. This service will be of little use if you want to get all the links of the site, because link_extractor does not crawl through all the links on the site.
Comments: 7
22.09.2021 20:13 #
Единственный нормальный сервис! Спасибо
Reply
23.09.2021 2:20 #
Благодарю! Рад, что сервис оказался полезным!
Reply
04.04.2022 20:36 #
Здравствуйте! Возникла такая проблема, что вчера забивала этот сайт timochko.ru здесь и выдавал все страницы, сегодня же пишет, что не найдено ни одной. И месяц назад также все находил, только сегодня возникла проблема и со всеми остальными сайтами, которые искала: ни одной ссылке не нашел. В чем может быть проблема? Забивала по адресу как обычно
Reply
07.04.2022 8:06 #
Добрый день! Да, оказывается ошибка есть. Спасибо, что сообщили! Постараюсь в скором времени разобраться
Reply
07.04.2022 8:20 #
Готово, ошибка исправлена! Приятной работы с сервисом! :)
Reply
07.04.2022 18:32 #
Спасибо большое! Все работает
Reply
31.05.2024 6:46 #
Спасибо за сервис поиска всех страниц сайта! Всё удобно, пользуюсь буквально каждую неделю
Reply