Open Access research papers with codes and datasets.

Pulse

Clubs

Papers

People

Datasets

Codes

PULSE: Our community science stream

ENDORSEMENTApril 26, 2024, 10:22 p.m.

How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning

Despite superior reasoning prowess demonstrated by Large Language Models (LLMs) with Chain-of-Thought (CoT) prompting, a lack of understanding prevails around the internal mechanisms of the …

BOOKMARKApril 24, 2024, 6:27 a.m.

You do not have to train Graph Neural Networks at all on text-attributed graphs

Graph structured data, specifically text-attributed graphs (TAG), effectively represent relationships among varied entities. Such graphs are essential for semi-supervised node classification tasks. Graph Neural Networks …

BOOKMARKApril 23, 2024, 9:40 p.m.

Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry

Proving geometric theorems constitutes a hallmark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered …

BOOKMARKApril 22, 2024, 10:47 p.m.

LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation

Weakly-Supervised Scene Graph Generation (WSSGG) research has recently emerged as an alternative to the fully-supervised approach that heavily relies on costly annotations. In this regard, …

ENDORSEMENTApril 21, 2024, 9:44 p.m.

To SMOTE, or not to SMOTE?

In imbalanced binary classification problems the objective metric is often non-symmetric and associates a higher penalty with the minority samples. On the other hand, the …

BOOKMARKApril 19, 2024, 1:38 a.m.

Long-form factuality in large language models

Large language models (LLMs) often generate content that contains factual errors when responding to fact-seeking prompts on open-ended topics. To benchmark a model's long-form factuality …

ENDORSEMENTApril 19, 2024, 12:03 a.m.

Neural Spline Flows

A normalizing flow models a complex probability density as an invertible transformation of a simple base density. Flows based on either coupling or autoregressive transforms …

ENDORSEMENTApril 17, 2024, 11:25 p.m.

Differentiable DAG Sampling

We propose a new differentiable probabilistic model over DAGs (DP-DAG). DP-DAG allows fast and differentiable DAG sampling suited to continuous optimization. To this end, DP-DAG …

ENDORSEMENTApril 17, 2024, 1:24 a.m.

Causal Bandits without Graph Learning

We study the causal bandit problem when the causal graph is unknown and develop an efficient algorithm for finding the parent node of the reward …

CLUBApril 16, 2024, 8:32 a.m.

Literatuuronderzoek Klimaatneutrale Industrie

Literatuursessies t.b.v. het programma Klimaatneutrale Industrie.
Kayewords: Energy transition, Industry

BOOKMARKApril 16, 2024, 4:59 a.m.

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

This work introduces an efficient method to scale Transformer-based Large Language Models (LLMs) to infinitely long inputs with bounded memory and computation. A key component …

CODEApril 16, 2024, 2:40 a.m.

JAX

JAX is a Python library for accelerator-oriented array computation and program transformation, designed for high-performance numerical computing and large-scale machine learning.

ENDORSEMENTApril 16, 2024, 2:33 a.m.

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

We introduce RecurrentGemma, an open language model which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on …

BOOKMARKApril 15, 2024, 5:44 a.m.

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to …

ENDORSEMENTApril 15, 2024, 3:18 a.m.

Finding Dino: A plug-and-play framework for unsupervised detection of out-of-distribution objects using prototypes

Detecting and localising unknown or Out-of-distribution (OOD) objects in any scene can be a challenging task in vision. Particularly, in safety-critical cases involving autonomous systems …

ENDORSEMENTApril 14, 2024, 11:06 p.m.

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

We present an approach for estimating the fraction of text in a large corpus which is likely to be substantially modified or produced by a …

ENDORSEMENTApril 14, 2024, 10:33 p.m.

A Neural Collapse Perspective on Feature Evolution in Graph Neural Networks

Graph neural networks (GNNs) have become increasingly popular for classification tasks on graph-structured data. Yet, the interplay between graph topology and feature evolution in GNNs …

CODEApril 14, 2024, 10:14 a.m.

RAGFlow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. It offers a streamlined RAG workflow for businesses of any scale, combining …

ENDORSEMENTApril 13, 2024, 9:35 p.m.

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

Large decoder-only language models (LMs) can be largely improved in terms of perplexity by retrieval (e.g., RETRO), but its impact on text generation quality and …

BOOKMARKApril 13, 2024, 9:52 a.m.

Retrieval Augmentation Reduces Hallucination in Conversation

Despite showing increasingly human-like conversational abilities, state-of-the-art dialogue models often suffer from factual incorrectness and hallucination of knowledge (Roller et al., 2020). In this work …

BOOKMARKApril 13, 2024, 8:01 a.m.

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

Various techniques have been developed in recent years to improve dense retrieval (DR), such as unsupervised contrastive learning and pseudo-query generation. Existing DRs, however, often …

BOOKMARKApril 13, 2024, 7:45 a.m.

ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT

Recent progress in Natural Language Understanding (NLU) is driving fast-paced advances in Information Retrieval (IR), largely owed to fine-tuning deep language models (LMs) for document …

ENDORSEMENTApril 12, 2024, 5:20 a.m.

More Agents Is All You Need

We find that, simply via a sampling-and-voting method, the performance of large language models (LLMs) scales with the number of agents instantiated. Also, this method …

BOOKMARKApril 12, 2024, 5:19 a.m.

Semantically-correlated memories in a dense associative model

I introduce a novel associative memory model named Correlated Dense Associative Memory (CDAM), which integrates both auto- and hetero-association in a unified framework for continuous-valued …

ENDORSEMENTApril 12, 2024, 3:37 a.m.

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

This work introduces an efficient method to scale Transformer-based Large Language Models (LLMs) to infinitely long inputs with bounded memory and computation. A key component …

1
1
2
2
Next

Next

We use cookies to ensure you get the best experience on our website. Learn more