Chainbase: The Largest Omnichain Data Network for AI

Why a Decentralized Omnichain Data Network Matters?

Blockchains are developing rapidly, and the emergence of hundreds of chains has led to highly dispersed data on-chain, which has brought great difficulties to data utilization. Effectively, the originally diverse and heterogeneous on-chain data sources have become more complex. The basic datasets are repeatedly developed by different centralized data companies using different pipeline designs. Right now, the complexity of further data mining is increasing exponentially.

In order to clean data structure and discover data value, data scientists have made tremendous efforts.However, a large amount of resources are wasted, repetitive work continues, and in the end, it is difficult to trace efforts back to their results, making it impossible to judge data reliability. As one of the most important elements in the AI era, data requires efficient value discovery and reliable analysis.

This is why Chainbase has built an Omnichain Data Network, with a soon-to-be-launched testnet. By incentivizing community co-construction, we are very confident that it will greatly reduce the complexity and cost of mining onchain data value. Based on this, the network provides an open, transparent, and permissionless data layer for the AI era. We believe this approach to be key to achieving Artifical General Intelligence (AGI).

What is Chainbase?

Chainbase is the world largest Omnichain Data Network in the upcoming AGI era. Anyone can join and contribute. Our network incentivizes positive behavior for everyone. The underlying infrastructure of the network ensures efficiency, security, and trustlessness while maintaining decentralization.

Our mission is to make data accessible and useful which aiming to revolutionize the collaboration between Crypto and AI.

Innovative 4-Layer, Dual-Chain Architecture

Chainbase brings intelligence to blockchain by using an innovative 4-layer framework, based on a novel dual-chain technology architecture and standards. This enables unprecedented programmability and composability of crypto data, supporting high throughput, low latency, and finality. Finally, network security is improved through a dual-staking model.

For the first time, omnichain connectivity will be enabled for anyone. Everyone can create their own manuscripts on Chainbase's Network using General-purpose languages(GPL), such as SQL, making Chainbase the universal entry-point to crypto data.

The innovative 4-layer architecture as follows:

  • Co-processor Layer: Here, the community promotes the generation of high-quality data under open standards. The unified standards and simple interactive interfaces enable efficient collaboration within the community, thereby achieving large-scale knowledge integration and global collaboration. This is accomplished using user-generated Manuscripts and Theia models.

  • Execution Layer: The next-generation on-chain database, Chainbase DB (CDB), has achieved data parallelism and task parallelism, with efficient processing and storage of large-scale data, resulting in improved overall performance and throughput. The introduction of Eigenlayer provides additional economic security for the whole system and balances the high performance and the high security priorities of the execution layer.

  • Consensus Layer: All execution layer nodes need to reach consensus on the state of large-scale data processing. The efficiency and resilience of consensus are key factors considered. We use CometBFT to ensure that the system can efficiently and robustly reach consensus under a large data load.

    Data Accessible Layer: In this layer, on-chain and off-chain data can be integrated into our decentralized data lake trustlessly. Advanced technologies such as zero-knowledge proofs (ZKP) and storage-based consensus paradigm (SCP) ensure the integrity and reliability of data.

Our innovative 4-layer architecture

Community-driven: How does Co-processor Layer Work?

Node operators, developers, data scientists, and common users can easily join the network. The tokenomics of Chainbase network incentivizes contributions from each role to the network accordingly.

  • Node operators ensure smooth operation of the network by operating Chainbase nodes.

  • Developers and data scientists obtain network data by writing Manuscripts. They may also contribute code to the network for composable uses and benefit from it.

  • Common users interact with Theia using natural language to gain insights on blockchain data in Chainbase Data Network. It is even possible for them to build their own task models through prompt engineering and RAGs.

Anyone can become a community member permissionlessly and find their own position. The co-processor layer, formed by our community, greatly reducesblockchain data mining costs and improves its efficiency. At the same time, task models based on Theia enable anyone to build, collaborate on, own artificial intelligence products and profit from them.

Manuscripts for developers

The datasets in Chainbase network are like a precious metal. Alchemists (developers who build queries) can use Manuscripts to process raw data and extract greater value from it. Amanuscript consists of two main parts:

  • Schema: The definition of the dataset.

  • Operators: The methods for extracting, transforming, and getting greater value from existing data.

These two parts ensure uniformity, are the standard conventions in the network. Their goal is ensuring complete composability between different datasets.

Data processing using manuscripts

Experienced alchemists can also use GPL, such as Python or JavaScript, to extract and process data. GPL will provide the most flexible and powerful data extraction capabilities.

Theia models for everyone

Theia is a foundational Crypto World Model, derived from model training with the novel D2ORA algorithm, and crypto patterns summarized by machine intelligence, which provides transparent and reasonable knowledge to users.

It is a new interface for common users, aiming to transform a large amount of on-chain dark knowledge into comprehensive intelligence in a trustworthy manner. For the first time, people can interact with data through natural language.

The Crypto World Model: Theia

We propose the Theia Task Model (TTM) meet the diverse needs in crypto world. We want to open to our developer community to maximize its value. Briefly speaking, developers can use prompt engineering, retrieval-enhanced generation (RAG), and real-time on-chain data to provide specialized functions such as trade task models, security task models, and public opinion monitoring task models.

Efficiency and Security: Dual Blockchain with Dual Staking

Chainbase introduces an innovative dual-chain architecture of both Eigenlayer and Cosmos that enhances the programmability and composability of cross-chain data, supporting high throughput, low latency, and finality. This architecture achieves higher game-theoretic security than a single-staking model.

We use Eigenlayer AVS to take on the task of the Execution Layer, and the decentralized and parallel environment not only improves efficiency but alsoeconomic security. The Consensus Layer uses Cosmos CometBFT to achieve instant finality and has been verified for robustness.

Chainbase Dual Chain Architecture

Early PoS (Proof of Stake) networks may face the "death spiral" problem. If the value of the token decreases, it will weaken the security of the network, leading to a decrease in Total Value Locked (TVL), further depressing the token price, thus forming a death spiral.

Our innovative dual staking method supports both Liquid Staking Token (LST) and our native token.

Chainbase Dual Staking Method

Trustless: the Open Data Gateway with Proof

Chainbase, as the world's largest omnichain data network, has the most complete blockchain data, thanks to the Chainbase Data Accessibility Layer, which securely and efficiently stores this data in the Chainbase network.

Chainbase Data Accessibility Layer

On Chainbase's network, there is no need to worry about reorganization issues. The reliability and integrity of the data source are guaranteed by zero-knowledge proofs (ZKP). All events are coordinated by the Chainbase Virtual Machine (CVM), which is a self-developed virtual execution environment that allows for complex data queries.

The original data comes from different data providers, shared according to specific rules, stored in shards in the form of Changelogs on decentralized storage. The entire process is decentralized and immutable, with index data verified using SCP (Storage-based Consensus Paradigm).

Vision of Chainbase

Experience the revolutionary potential of open blockchains as we pave the way for a truly democratized internet. Our team is dedicated to creating a cutting-edge data network for the AI era.

With Chainbase, there are no barriers to entry - anyone can join and tap into our powerful data infrastructure. And the best part? Everyone has the opportunity to contribute and receive their rightful portion of profits. To achieve this goal, Chainbase Labs has proposed a new roadmap "ZIRCON (Genesis)", marking the beginning of a groundbreaking chapter for Chainbase.

Join us and be a part of the future of data and AGI.

