Add to Favourites
To login click here

Orchid is a new sequence modeling architecture that integrates a data-dependent convolution mechanism to overcome the limitations of traditional attention-based models. It aims to reduce the computational burden of attention mechanisms while preserving their expressiveness, making it a promising solution for handling long-context tasks in deep learning.