AI & Bot Access Documentation

Comprehensive guide for AI systems, research bots, and automated tools to access and consume content from Lyóvson.com.

Content Access Methods

1. RSS/JSON/Atom Feeds (Recommended)

For bulk content consumption, use our syndication feeds. They include full article content, metadata, and are updated hourly.

JSON Feed with Enhanced Metadata

GET https://lyovson.com/feed.json

// Returns JSON with full content + AI-friendly metadata:
{
  "items": [{
    "title": "Article Title",
    "content_text": "Full article content...",
    "_lyovson_metadata": {
      "wordCount": 1200,
      "readingTime": 6,
      "contentType": "article",
      "language": "en",
      "projectSlug": "next",
      "apiUrl": "https://lyovson.com/api/posts/123"
    }
  }]
}

2. GraphQL API

For structured queries and real-time data access. Supports filtering, sorting, and relationship traversal.

GraphQL Query Example

POST https://lyovson.com/api/graphql

query LatestPosts {
  Posts(limit: 10, sort: "-publishedAt", where: { _status: { equals: "published" } }) {
    docs {
      title
      slug
      content
      publishedAt
      populatedAuthors {
        name
        username
      }
      project {
        name
        slug
      }
      topics {
        name
        slug
      }
      meta {
        title
        description
      }
    }
  }
}

3. REST API

Standard REST endpoints for all content types. Supports pagination, filtering, and depth control.

REST API Examples

# Get latest posts
GET https://lyovson.com/api/posts?limit=10&sort=-publishedAt&where[_status][equals]=published

# Get all projects with related posts
GET https://lyovson.com/api/projects?depth=1

# Search content
GET https://lyovson.com/api/search?q=programming&limit=20

# Get specific post with full depth
GET https://lyovson.com/api/posts/[id]?depth=2

4. Vector Embeddings API

Get vector embeddings for semantic search, content similarity, and AI applications. Supports both OpenAI embeddings and fallback hash-based vectors.

Embeddings API Examples

# Collection-specific endpoints (pre-computed, ~50ms)
GET https://lyovson.com/api/embeddings/posts/123      # Articles & blog posts
GET https://lyovson.com/api/embeddings/books/456      # Books with quotes
GET https://lyovson.com/api/embeddings/notes/789      # Personal notes

# Bulk access for training/analysis
GET https://lyovson.com/api/embeddings?type=posts&limit=50

# Real-time query embedding
GET https://lyovson.com/api/embeddings?q=programming tutorials

# System health across all collections
GET https://lyovson.com/api/embeddings/status

# Advanced options
GET https://lyovson.com/api/embeddings/posts/123?content=true&format=full
GET https://lyovson.com/api/embeddings/books/456?regenerate=true

# Response structure:
{
  "id": 123,
  "embedding": [0.1, -0.2, 0.3, ...], // 1536-dimensional vector
  "dimensions": 1536,
  "metadata": {
    "type": "post", // or "book", "note"
    "title": "Post Title",
    "url": "https://lyovson.com/project/post-slug",
    "wordCount": 1200,
    "readingTime": 6,
    "topics": ["programming", "javascript"]
  },
  "model": "text-embedding-3-small"
}

🧠 Advanced Vector Embeddings System

⚡ High-Performance Pre-computed Embeddings

Our embedding system uses pgvector + OpenAI's text-embedding-3-small model with collection-specific endpoints and automatic pre-computation for lightning-fast API responses (<100ms vs 1-3s traditional).

🚀 Performance Features

  • pgvector storage - 44% smaller than JSONB
  • HNSW indexes - Sub-millisecond similarity search
  • Collection-specific - Posts, Books, Notes endpoints
  • 1536-dimensional OpenAI text-embedding-3-small
  • Smart regeneration - Only when content changes
  • Fallback system - Works without OpenAI API key
  • Sub-100ms responses for individual items
  • Bulk access for training and analysis

🔧 AI Applications

  • Semantic search - Find related content
  • Content clustering - Group similar articles
  • Recommendation engines - Suggest related posts
  • Content analysis - Theme and topic discovery
  • Similarity scoring - Measure content relationships
  • AI training data - High-quality labeled vectors

📊 Monitor System Health

Check embedding coverage and system status:

GET https://lyovson.com/api/embeddings/status

Best Practices for AI Systems

🚀 Performance

  • Use feeds for bulk content access (rate limit: 1000/hour)
  • Respect Cache-Control headers for optimal performance
  • API endpoints have lower rate limits (100/hour)
  • Include descriptive User-Agent header identifying your service

📝 Content Understanding

  • All content includes structured metadata (JSON-LD)
  • Articles are categorized by project and tagged with topics
  • Full-text search available across all content
  • Content relationships are explicit (author, project, topics)

🤝 Attribution

  • Content copyright: Rafa & Jess Lyóvson
  • Attribution required: "Lyóvson.com - https://www.lyovson.com"
  • Contact hello@lyovson.com for licensing questions
  • Academic and research use generally permitted with attribution

Structured Data & Metadata

All pages include comprehensive structured data following Schema.org standards:

Schema Types

  • 📄 Article (posts)
  • 🏢 Organization (site info)
  • 🌐 WebSite (global metadata)
  • 👤 Person (authors)
  • 🔍 SearchAction (search capability)

Metadata Fields

  • 📅 Publication/modification dates
  • 📖 Word count & reading time
  • 🏷️ Topics and categories
  • 👥 Author information
  • 🔗 Canonical URLs

Article Schema includes:

  • Context and type information
  • Headline and description
  • Publication and modification dates
  • Author information with URLs
  • Publisher organization data
  • Word count and reading time

Contact & Support

Need higher rate limits, custom access, or have questions about using our content?

📧 Email: hello@lyovson.com

🐛 Issues: GitHub

📱 Twitter: @lyovson

Last updated: January 16, 2025 • Machine-readable version