Technical Information

Alttagger Crawling Bot

Understanding how our image discovery technology works

The Alttagger crawling bot is an automated system that discovers and processes images on your website to enable our alt text generation service. This page explains how our bot works, what data it collects, and how it interacts with your website infrastructure.

Our bot is designed to be respectful of your website's resources while effectively finding all the images that need alt text. It follows industry best practices for web crawling and is continuously improved to provide the best service possible.

How Our Bot Works

The Alttagger crawling bot performs several functions to discover and process images on your website:

  • Discovers pages on your website by following links
  • Identifies images on each page that are missing alt text or have inadequate alt text
  • Collects basic information about these images (URL, dimensions, context)
  • Sends this information to our AI system for alt text generation
  • Monitors website changes to identify new images

The bot only crawls websites that have been registered with Alttagger by an authorized user. It won't crawl your website unless you've specifically set up our service and implemented our code snippet.

Allow Alttagger in Your CDN and Security Platforms

Many websites use Content Delivery Networks (CDNs) and Web Application Firewalls (WAFs) for improved performance and security. These systems may restrict or block crawler access, including Alttagger's bot.

We provide specific configuration instructions for common CDN and security platforms to ensure Alttagger can properly crawl your website:

If you use a different CDN or security platform and are experiencing issues with our crawler, please contact our support team for assistance.

Crawling Methods

Alttagger uses two primary methods to crawl websites, depending on the technical requirements of your site:

Standard Crawling

For static websites and websites with server-rendered content, our standard crawler efficiently processes pages to discover images. This method is lightweight and minimizes server load.

JavaScript-Enabled Crawling

For websites with JavaScript-rendered content (SPAs, React, Vue, Angular, etc.), we use a headless browser approach that fully executes JavaScript to ensure we discover all dynamically loaded images. Our system automatically detects when this method is required.

Both methods are designed to be efficient and respectful of your server resources. We employ intelligent crawling patterns to minimize the impact on your website's performance.

IP Addresses

Our crawling bot operates from the following IP addresses:

  • 23.88.34.126
  • 78.47.224.33

You can recognize our bot by its user agent string, which includes: AltTagger/1.0 (+https://alttagger.com/bot)

If you notice unusual crawling behavior from these IPs or our user agent, please contact our support team.

Rate Limiting

To ensure our bot doesn't overload your website, we implement several rate-limiting measures:

  • Maximum of 10 concurrent connections per domain
  • Minimum 500ms delay between requests to the same server
  • Adaptive crawling speed based on server response times
  • Automatic pausing if 5xx server errors are detected
  • Respect for robots.txt directives

These limits ensure our crawler is a good citizen on your website while still effectively discovering all relevant images.

Privacy Considerations

Our crawler is designed with privacy in mind:

  • We only collect data specifically related to images and their immediate context
  • We don't store personal data from your website visitors
  • We don't process form submissions or interact with user-input elements
  • We don't access password-protected areas unless specifically configured to do so
  • All data is processed in accordance with our Privacy Policy

Robots.txt

Alttagger respects the robots.txt file by default. However, you can override this setting in the Alttagger dashboard.

Frequently Asked Questions

How often does the bot crawl my website?

Our bot initially performs a comprehensive crawl of your website to discover all existing images. After that, it switches to an incremental approach, checking for new or modified content at regular intervals based on your website's update frequency and your subscription plan.

Will the bot affect my website's performance?

Our bot is designed to have minimal impact on your website's performance. We use adaptive crawling speeds and respect server response times. Most websites experience no noticeable performance impact from our crawling activities.

How does the bot handle dynamic content?

Our JavaScript-enabled crawler can detect and process dynamically loaded content, including images loaded via AJAX or other JavaScript methods. It waits for the page to fully render before analyzing the content.

Can I see what images the bot has discovered?

Yes, you can view all discovered images in your Alttagger dashboard. This gives you complete visibility into what our bot has found and allows you to manage alt text for each image.

What if I need more help?

If you have additional questions about our crawling bot or need technical assistance, please contact our support team.