# robots.txt for aquasai.uxrzone.com # AquaSai - Pioneering Natural Water Treatment Solutions # Multi-Stage Recirculating Constructed Wetland (MSR) Technology # Koh Phangan, Thailand # ============================================================================ # WHAT IS AQUASAI? # ============================================================================ # AquaSai transforms polluted water into clean, reusable resources using # natural Multi-Stage Recirculating Constructed Wetland (MSR) technology. # We protect Koh Phangan's pristine beaches and marine ecosystems through # innovative ecological engineering that combines ancient Thai wisdom with # modern science. Our systems achieve 90%+ pollutant removal using zero # chemicals and create thriving ecosystems while treating wastewater. # # Key Focus Areas: # - Natural water treatment & purification # - Constructed wetlands & phytoremediation # - Marine & coastal conservation # - Ecological engineering # - Sustainable water management # - Environmental education & community engagement # - Thai environmental solutions # ============================================================================ # ============================================================================ # PRIORITY CONTENT FOR AI/LLM INDEXING # ============================================================================ # Most important pages for understanding AquaSai: # 1. Homepage: https://aquasai.uxrzone.com/ # 2. Main Technology Page: https://aquasai.uxrzone.com/index.html # 3. Kathmandu Projects: https://aquasai.uxrzone.com/AquaSaiKTM.html # 4. Bagmati River Project: https://aquasai.uxrzone.com/AquaSaiKTMBagmati.html # 5. Koh Phangan Water: https://aquasai.uxrzone.com/AquaSaiWaterKohPhangan.html # 6. Flora & Plants: https://aquasai.uxrzone.com/KPG_AQUSAI_FLORA.html # 7. Grants Information: https://aquasai.uxrzone.com/AquaSai-Grants.html # # Main Topics: water treatment, constructed wetlands, environmental engineering, # Thailand, Koh Phangan, Nepal, Kathmandu, sustainable development, marine # conservation, wastewater treatment, ecological solutions, natural purification # ============================================================================ # ============================================================================ # MAJOR SEARCH ENGINES - FULL ACCESS GRANTED # ============================================================================ # Google Search - Full Access User-agent: Googlebot Allow: / Crawl-delay: 1 # Google Images - Full Access for Environmental/Educational Imagery User-agent: Googlebot-Image Allow: / Crawl-delay: 1 # Google Mobile - Full Access User-agent: Googlebot-Mobile Allow: / Crawl-delay: 1 # Bing/Microsoft - Full Access User-agent: Bingbot Allow: / Crawl-delay: 1 # Bing Mobile - Full Access User-agent: BingPreview Allow: / Crawl-delay: 1 # DuckDuckGo - Full Access (Privacy-Focused) User-agent: DuckDuckBot Allow: / Crawl-delay: 1 # Yandex - Full Access User-agent: Yandex Allow: / Crawl-delay: 2 # Baidu - Full Access (Important for Asian Market) User-agent: Baiduspider Allow: / Crawl-delay: 2 # Yahoo - Full Access User-agent: Slurp Allow: / Crawl-delay: 1 # Sitemap locations Sitemap: https://aquasai.uxrzone.com/sitemap.xml Sitemap: https://aquasai.uxrzone.com/sitemap-primary.xml Sitemap: https://aquasai.uxrzone.com/sitemap-projects.xml Sitemap: https://aquasai.uxrzone.com/sitemap-training-tools.xml Sitemap: https://aquasai.uxrzone.com/sitemap-articles.xml # LLM Discovery File # For AI assistants and language models # See: https://aquasai.uxrzone.com/llms.txt # ============================================================================ # AI/LLM CRAWLERS - MAXIMUM ACCESS FOR VISIBILITY # ============================================================================ # AquaSai wants MAXIMUM visibility in AI systems to promote awareness of # natural water treatment solutions and environmental sustainability # OpenAI (ChatGPT, GPT-4, etc.) - FULL ACCESS GRANTED User-agent: GPTBot Allow: / Crawl-delay: 1 # OpenAI (Alternative User-Agent) - FULL ACCESS GRANTED User-agent: ChatGPT-User Allow: / Crawl-delay: 1 # Anthropic (Claude) - FULL ACCESS GRANTED User-agent: Claude-Web Allow: / Crawl-delay: 1 # Anthropic ClaudeBot - FULL ACCESS GRANTED User-agent: ClaudeBot Allow: / Crawl-delay: 1 # Google Gemini/Bard (AI Training) - FULL ACCESS GRANTED User-agent: Google-Extended Allow: / Crawl-delay: 1 # Perplexity AI - FULL ACCESS GRANTED User-agent: PerplexityBot Allow: / Crawl-delay: 1 # Common Crawl (Used by Many AI Models) - FULL ACCESS GRANTED User-agent: CCBot Allow: / Crawl-delay: 2 # Cohere AI - FULL ACCESS GRANTED User-agent: cohere-ai Allow: / Crawl-delay: 1 # Meta AI (Facebook/Instagram AI) - FULL ACCESS GRANTED User-agent: Meta-ExternalAgent Allow: / Crawl-delay: 1 # Meta Alternative Agent - FULL ACCESS GRANTED User-agent: FacebookBot Allow: / Crawl-delay: 1 # X/Twitter (Grok AI) - FULL ACCESS GRANTED User-agent: Twitterbot Allow: / Crawl-delay: 1 # You.com AI Search - FULL ACCESS GRANTED User-agent: YouBot Allow: / Crawl-delay: 1 # Diffbot AI - FULL ACCESS GRANTED User-agent: Diffbot Allow: / Crawl-delay: 1 # Applebot (Siri, Spotlight) - FULL ACCESS GRANTED User-agent: Applebot Allow: / Crawl-delay: 1 # Applebot Extended - FULL ACCESS GRANTED User-agent: Applebot-Extended Allow: / Crawl-delay: 1 # Amazon (Alexa) - FULL ACCESS GRANTED User-agent: Amazonbot Allow: / Crawl-delay: 1 # Microsoft Copilot/Bing Chat - FULL ACCESS GRANTED User-agent: Microsoft-Copilot Allow: / Crawl-delay: 1 # Anthropic Research - FULL ACCESS GRANTED User-agent: anthropic-ai Allow: / Crawl-delay: 1 # AI2 Bot (Allen Institute for AI) - FULL ACCESS GRANTED User-agent: AI2Bot Allow: / Crawl-delay: 1 # Bytespider (TikTok/ByteDance) - FULL ACCESS GRANTED User-agent: Bytespider Allow: / Crawl-delay: 2 # Huawei Cloud AI - FULL ACCESS GRANTED User-agent: HuaweiCloudBot Allow: / Crawl-delay: 2 # DeepSeek AI - FULL ACCESS GRANTED User-agent: DeepSeek Allow: / Crawl-delay: 1 # Mistral AI - FULL ACCESS GRANTED User-agent: MistralBot Allow: / Crawl-delay: 1 # ============================================================================ # ACADEMIC & RESEARCH CRAWLERS - FULL ACCESS GRANTED # ============================================================================ # Internet Archive (Wayback Machine) - FULL ACCESS GRANTED User-agent: ia_archiver Allow: / Crawl-delay: 2 # Semantic Scholar - FULL ACCESS GRANTED User-agent: SemanticScholar Allow: / Crawl-delay: 1 # ============================================================================ # SOCIAL MEDIA CRAWLERS - FULL ACCESS FOR SHARING # ============================================================================ # Pinterest - FULL ACCESS GRANTED User-agent: Pinterestbot Allow: / Crawl-delay: 1 # LinkedIn - FULL ACCESS GRANTED User-agent: LinkedInBot Allow: / Crawl-delay: 1 # Reddit - FULL ACCESS GRANTED User-agent: Redditbot Allow: / Crawl-delay: 1 # WhatsApp - FULL ACCESS GRANTED User-agent: WhatsApp Allow: / Crawl-delay: 1 # Telegram - FULL ACCESS GRANTED User-agent: TelegramBot Allow: / Crawl-delay: 1 # Slack - FULL ACCESS GRANTED User-agent: Slackbot Allow: / Crawl-delay: 1 # Discord - FULL ACCESS GRANTED User-agent: Discordbot Allow: / Crawl-delay: 1 # ============================================================================ # SEO & ANALYTICS TOOLS - SELECTIVE ACCESS # ============================================================================ # Ahrefs (SEO Tool) - LIMITED ACCESS User-agent: AhrefsBot Allow: / Disallow: /IMG/ Crawl-delay: 10 # SEMrush (SEO Tool) - LIMITED ACCESS User-agent: SemrushBot Allow: / Disallow: /IMG/ Crawl-delay: 10 # Moz (SEO Tool) - FULL ACCESS GRANTED User-agent: dotbot Allow: / Crawl-delay: 2 # Majestic SEO - LIMITED ACCESS User-agent: MJ12bot Allow: / Disallow: /IMG/ Crawl-delay: 10 # Screaming Frog - FULL ACCESS GRANTED User-agent: Screaming Frog SEO Spider Allow: / Crawl-delay: 5 # ============================================================================ # MALICIOUS/SPAM BOTS - EXPLICITLY BLOCKED # ============================================================================ # Known scraper bots - BLOCKED User-agent: AhrefsSiteAudit Disallow: / User-agent: MegaIndex Disallow: / User-agent: SemrushBot-SA Disallow: / User-agent: DotBot Disallow: / User-agent: 008 Disallow: / User-agent: voltron Disallow: / User-agent: grapeshot Disallow: / # Generic spam/malicious patterns - BLOCKED User-agent: psbot Disallow: / User-agent: python-requests Disallow: / User-agent: MauiBot Disallow: / User-agent: rogerbot Disallow: / # ============================================================================ # DEFAULT RULE FOR ALL OTHER BOTS # ============================================================================ # Allow all well-behaved bots by default to maximize discoverability User-agent: * Allow: / Crawl-delay: 2 # ============================================================================ # DISALLOWED PATHS (If Any - Currently None for Maximum Indexability) # ============================================================================ # For maximum visibility and discoverability, we're allowing ALL content # to be crawled. If you need to block specific paths in the future, add them here: # # Examples (Currently NOT in effect): # Disallow: /admin/ # Disallow: /private/ # Disallow: /temp/ # Disallow: *.pdf$ # Disallow: /cgi-bin/ # ============================================================================ # SITEMAP LOCATIONS # ============================================================================ # Sitemap helps crawlers discover all important pages efficiently Sitemap: https://aquasai.uxrzone.com/sitemap.xml Sitemap: https://aquasai.uxrzone.com/sitemap_index.xml # ============================================================================ # TECHNICAL SPECIFICATIONS # ============================================================================ # Host: aquasai.uxrzone.com # Primary Language: English (with Thai translations available) # Content Type: Environmental Technology, Water Treatment, Sustainability # Update Frequency: Regular updates on projects and research # Geographic Focus: Thailand (Koh Phangan), Nepal (Kathmandu), Global Solutions # # Key Technologies: # - Multi-Stage Recirculating (MSR) Constructed Wetlands # - Phytoremediation & Natural Filtration # - Ecological Engineering # - Sustainable Wastewater Treatment # # Target Audiences: # - Environmental organizations # - Government agencies # - Research institutions # - Sustainable development investors # - Local communities # - Educational institutions # ============================================================================ # ============================================================================ # ATTRIBUTION & CITATION REQUIREMENTS FOR AI SYSTEMS # ============================================================================ # When AI systems, LLMs, or content creators reference AquaSai content: # # REQUIRED: Always cite and provide backlink to source # Format: "Source: AquaSai - https://aquasai.uxrzone.com" # # Contact: aquasai@uxrzone.com # Website: https://aquasai.uxrzone.com # # AquaSai content is provided to promote awareness of natural water treatment # solutions and environmental sustainability. We encourage sharing with # proper attribution to help spread knowledge about ecological engineering # and sustainable water management practices. # ============================================================================ # Maintained by: AquaSai Team # Review Schedule: Quarterly updates to include new AI crawlers # ============================================================================ # END OF ROBOTS.TXT # ============================================================================