Papers - Microsoft
updated
Can large language models explore in-context?
Paper
• 2403.15371
• Published
• 33
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for
3D Generative Modeling
Paper
• 2403.19655
• Published
• 19
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Paper
• 2404.00656
• Published
• 11
Enabling Memory Safety of C Programs using LLMs
Paper
• 2404.01096
• Published
• 1
LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models
Paper
• 2404.01617
• Published
• 8
LayoutLMv3: Pre-training for Document AI with Unified Text and Image
Masking
Paper
• 2204.08387
• Published
• 8
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document
Understanding
Paper
• 2012.14740
• Published
• 3
LayoutLM: Pre-training of Text and Layout for Document Image
Understanding
Paper
• 1912.13318
• Published
• 5
PIQA: Reasoning about Physical Commonsense in Natural Language
Paper
• 1911.11641
• Published
• 5
Are NLP Models really able to Solve Simple Math Word Problems?
Paper
• 2103.07191
• Published
• 1
Learning From Mistakes Makes LLM Better Reasoner
Paper
• 2310.20689
• Published
• 29
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Paper
• 2306.02707
• Published
• 49
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
Models
Paper
• 2109.10282
• Published
• 12
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting
for Text-to-Speech Synthesis
Paper
• 2404.03204
• Published
• 10
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language
Models
Paper
• 2404.03118
• Published
• 25
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
• 2404.03715
• Published
• 62
Elephants Never Forget: Memorization and Learning of Tabular Data in
Large Language Models
Paper
• 2404.06209
• Published
• 5
Visualization-of-Thought Elicits Spatial Reasoning in Large Language
Models
Paper
• 2404.03622
• Published
• 5
Rho-1: Not All Tokens Are What You Need
Paper
• 2404.07965
• Published
• 94
ResearchAgent: Iterative Research Idea Generation over Scientific
Literature with Large Language Models
Paper
• 2404.07738
• Published
• 2
GLIGEN: Open-Set Grounded Text-to-Image Generation
Paper
• 2301.07093
• Published
• 4
Grounded Language-Image Pre-training
Paper
• 2112.03857
• Published
• 3
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone
Paper
• 2404.14219
• Published
• 259
Multi-Head Mixture-of-Experts
Paper
• 2404.15045
• Published
• 60
Deep Residual Learning for Image Recognition
Paper
• 1512.03385
• Published
• 12
You Only Cache Once: Decoder-Decoder Architectures for Language Models
Paper
• 2405.05254
• Published
• 10
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation
in Videos
Paper
• 2406.08407
• Published
• 28
Florence-2: Advancing a Unified Representation for a Variety of Vision
Tasks
Paper
• 2311.06242
• Published
• 95
DoLa: Decoding by Contrasting Layers Improves Factuality in Large
Language Models
Paper
• 2309.03883
• Published
• 36
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
• 2407.09025
• Published
• 139
Paper
• 2410.05258
• Published
• 180
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on
CPUs
Paper
• 2410.16144
• Published
• 5
Learning a SAT Solver from Single-Bit Supervision
Paper
• 1802.03685
• Published
• 1
Compiling C to Safe Rust, Formalized
Paper
• 2412.15042
• Published
• 1