Track-A-Bot website logo and homepage link

Bot IP: 117.50.213.40

Bot Name: ChatGLM Spider
ChatGLM Spider is a web crawler associated with the ChatGLM project, likely used for gathering web data to improve or train large language models developed by Tsinghua University's GLM (General Language Model) team.
Unverified
Track-A-Bot Trust Score: 41/100
0255075100
Why Track-A-Bot rated this IP (tap to expand)

Identity

  • ✅ Marked as bot traffic
  • ✅ Bot name present
  • ✅ User-agent matches bot name
  • ✅ User-agent includes identity link/email

Hostname

  • ⚠️ No hostname observed
  • ℹ️ Hostname does not resemble bot name
  • ℹ️ Hostname status not verified
  • ✅ No hostname mismatch flags

Behavior

  • ℹ️ No /robots.txt request observed
  • ✅ No referrer (typical crawler behavior)
  • ⚠️ Many non-200/304 responses
  • 📌 Visits: 1
  • ⏱️ Age: 0h

Penalties

  • ✅ No major spoof pattern
  • ✅ No fingerprint fields
  • ✅ Cookies not enabled
  • ✅ No connection type
Total visits: 1 First seen: Sep 21st, 2024 7:01 AM Last seen: Sep 21st, 2024 7:01 AM
Known User Agent(s)
Mozilla/5.0 (compatible; ChatGLM-Spider/1.0; +https://chatglm.cn/)
Known Accepted Language(s)
zh-CN,zh;q=0.9,en;q=0.8,zh-TW;q=0.7
Known Accepted Encoding(s)
gzip, deflate, br

Common User-Agent Tokens

These are identifiers found inside user-agent strings we observed for chatglm-spider.

Token
Seen
First seen
Last seen
chatglm-spider/1.0
50
6/30/24
9/26/24
chatglm-spider/1.0 50 seen
First 6/30/24
Last 9/26/24

Identity Links

These are URLs we observed in association with chatglm-spider (often used as an identity, help, or policy link).

URL
Seen
First seen
Last seen
50
6/30/24
9/26/24
chatglm.cn/ 50 seen chatglm.cn
First 6/30/24
Last 9/26/24

Bot FAQ for this IP


Why is chatglm-spider visiting my website?

The IP address 117.50.213.40, identified by the user agent 'ChatGLM-Spider/1.0', is accessing websites as part of automated web crawling activity. The bot appears to be associated with chatglm.cn, as indicated in its user agent string. Such crawlers typically visit public web pages to index content, gather data for search or AI purposes, or analyze site structure. The specific visit recorded was to a publicly accessible URL, which is common behavior for web crawlers. If this activity is unexpected or unwanted, consider reviewing your site's robots.txt file to specify crawl permissions, monitoring server logs for unusual patterns, and using allow/block lists or firewall rules to control access.

What does chatglm-spider do?

ChatGLM-Spider is an automated web crawler, as indicated by its user agent. Its primary function is likely to index or analyze publicly available website content, potentially for search, AI training, or data aggregation purposes. The bot identifies itself transparently in the user agent and references chatglm.cn for more information. There is no evidence in the provided data of malicious behavior, but the exact scope of its data collection is not fully documented here. To verify the crawler's intent, check the user agent string in your server logs, perform a reverse DNS lookup on the IP address, and consult the referenced website for additional details.

Is chatglm-spider safe?

Based on the available information, chatglm-spider appears to be a legitimate web crawler that identifies itself and references an official website. There are no immediate signs of harmful activity in the recent access logs. However, as with any automated traffic, it is important to monitor for excessive requests or unexpected behavior. To ensure safety, consider: 1) Reviewing your server logs for unusual patterns from this IP, 2) Implementing rate limiting to prevent resource overuse, 3) Updating your robots.txt file to control crawler access, 4) Using allow/block lists or web application firewall (WAF) rules if necessary, and 5) Performing a reverse DNS lookup to confirm the crawler's origin.

This page provides a detailed lookup for the bot with the IP address of 117.50.213.40, based on real-world traffic observed across websites using Track-A-Bot software. Use this information to verify whether activity from this IP belongs to a legitimate search engine crawler, or to analyze bot behavior on your website.