• 首页
  • 时间轴
  • 分类
  • 标签
  • 搜索
szh's Blog

szh's Blog


嗨 是你啊
标签 Masked Image Modeling
Machine Learning Computer Vision

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

1 研究背景、动机、主要贡献1.1 存在问题(动机)自回归生成由于图像令牌数量庞大,效率低下;而非自回归方法(如MIM)则在性能上有限,无法与先进的扩散模型相比。 1.2 主要贡献 增强的变换器架构:结合多模态和单模态变换器层,提高M...

2024-10-18 Masked Image Modeling, Transformer, 图像生成 阅读全文

szh’s Blog

文章 27 分类 6 标签 13

分类目录

  • Machine Learning22
    • Computer Vision22
  • 大模型安全3
    • 幻觉3
  • 杂1
    • Hexo1

标签合集

Autoregressive Model Benchmark Diffusion Model Diffusion 可控生成 Hexo Masked Image Modeling Transformer VQ inpainting 图像生成 大模型安全-幻觉 视频生成 论文阅读

最新文章

    Taming Scalable Visual Tokenizer for Autoregressive Image Generation Addressing Representation Collapse in Vector Quantized Models with One Linear Layer T2I-CompBench++: An Enhanced and Comprehensive Benchmark for Compositional Text-to-image Generation Blended Latent Diffusion Blended Diffusion for Text-driven Editing of Natural Images
  • © 2025 szh's Blog 版权所有.
  • 本站已运行Loading...
  • Theme Kratos:Rebirth
  • Site built with  by szh.
  • Powered by Hexo
  • Hosted on Github Pages