News

Artificial intelligence is painting pictures, writing novels, making videos, and composing symphonies. Can it change what we ...
Abstract: Remote sensing image classification plays a crucial ... In this paper, a new hybrid cross-activation network (HC-Mamba) is proposed based on the Visual State Space (VSS) model represented by ...
For scientific reasoning, it achieved 69.3 points on the GPQA-diamond test. Tencent says the model particularly excels in math. It achieved 96.2 points on the MATH-500 benchmark, placing just behind ...
Abstract: Current visual captioning technologies typically ... To address this issue, we propose STPos-VC, a pre-trained vision-language model that maps visual information from the visual vector space ...
A generalizable Hi-C foundation model for chromatin architecture, single-cell and multi-omics analysis across species. bioRxiv, 2024. Paper @article{wang2024hicfoundation, title={A generalizable Hi-C ...
The ACMM is a framework that defines five levels of architecture capability and maturity, from Level 1 (Initial) to Level 5 (Optimizing). Each level has a set of attributes, processes, and ...
Ovis (Open VISion) is a novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings. For a comprehensive introduction, please refer to the ...