Track-A-Bot website logo and homepage link

Bot IP: 117.50.215.163

Bot Name: ChatGLM Spider
ChatGLM Spider is a web crawler associated with the ChatGLM project, likely used for gathering web data to improve or train large language models developed by Tsinghua University's GLM (General Language Model) team.
Unverified
Track-A-Bot Trust Score: 67/100
0255075100
Why Track-A-Bot rated this IP (tap to expand)

Identity

  • ✅ Marked as bot traffic
  • ✅ Bot name present
  • ✅ User-agent matches bot name
  • ✅ User-agent includes identity link/email

Hostname

  • ⚠️ No hostname observed
  • ℹ️ Hostname does not resemble bot name
  • ℹ️ Hostname status not verified
  • ✅ No hostname mismatch flags

Behavior

  • ℹ️ No /robots.txt request observed
  • ✅ No referrer (typical crawler behavior)
  • ✅ Mostly 200/304 responses
  • 📌 Visits: 14
  • ⏱️ Age: 95h

Penalties

  • ✅ No major spoof pattern
  • ✅ No fingerprint fields
  • ✅ Cookies not enabled
  • ✅ No connection type
Total visits: 14 First seen: Aug 5th, 2024 4:03 AM Last seen: Aug 9th, 2024 3:35 AM
Known User Agent(s)
Mozilla/5.0 (compatible; ChatGLM-Spider/1.0; +https://chatglm.cn/)
Known Accepted Language(s)
zh-CN,zh;q=0.9,en;q=0.8,zh-TW;q=0.7
Known Accepted Encoding(s)
gzip, deflate, br

Common User-Agent Tokens

These are identifiers found inside user-agent strings we observed for chatglm-spider.

Token
Seen
First seen
Last seen
chatglm-spider/1.0
50
6/30/24
9/26/24
chatglm-spider/1.0 50 seen
First 6/30/24
Last 9/26/24

Identity Links

These are URLs we observed in association with chatglm-spider (often used as an identity, help, or policy link).

URL
Seen
First seen
Last seen
50
6/30/24
9/26/24
chatglm.cn/ 50 seen chatglm.cn
First 6/30/24
Last 9/26/24

Bot FAQ for this IP


Why is chatglm-spider visiting my website?

The IP address 117.50.215.163, identifying itself as "chatglm-spider" via its user agent string (Mozilla/5.0 (compatible; ChatGLM-Spider/1.0; +https://chatglm.cn/)), is visiting websites to crawl and index publicly available content. This behavior is typical of web crawlers or bots that systematically access web pages, including blog posts, category pages, sitemaps, and contact pages. The purpose is generally to collect information for search, AI, or data analysis services. If chatglm-spider is accessing your site, it is likely gathering publicly accessible data as part of its indexing or research activities.

What does chatglm-spider do?

chatglm-spider appears to be an automated web crawler that accesses various pages on websites, as indicated by its user agent and visit patterns. Its requests target a range of URLs, including blog articles, sitemaps, and category pages, which is consistent with indexing or data collection activities. The user agent references https://chatglm.cn/, suggesting an association with the ChatGLM project. To verify its identity, check the user agent in your server logs and perform a reverse DNS lookup on the IP address. For management, consider updating your robots.txt file, monitoring access frequency, and using allow/block lists or web application firewall (WAF) rules as needed.

Is chatglm-spider safe?

chatglm-spider identifies itself transparently and follows typical crawling patterns, but its safety cannot be fully guaranteed without further verification. There is no evidence from the provided data of malicious activity, but as with any automated bot, it is important to monitor its behavior. To ensure safety: 1) Check the reverse DNS of 117.50.215.163 to confirm its origin; 2) Review server logs for unusual or excessive requests; 3) Use robots.txt to control crawler access; 4) Apply rate limiting or WAF rules if necessary; 5) Add the IP to allow or block lists based on your site's policy. Regular monitoring is recommended.

This page provides a detailed lookup for the bot with the IP address of 117.50.215.163, based on real-world traffic observed across websites using Track-A-Bot software. Use this information to verify whether activity from this IP belongs to a legitimate search engine crawler, or to analyze bot behavior on your website.