default search action
Tsu-Jui Fu
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c31]Raphael Schumann, Wanrong Zhu, Weixi Feng, Tsu-Jui Fu, Stefan Riezler, William Yang Wang:
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View. AAAI 2024: 18924-18933 - [c30]Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan:
Guiding Instruction-based Image Editing via Multimodal Large Language Models. ICLR 2024 - [i30]Haotian Zhang, Haoxuan You, Philipp Dufter, Bowen Zhang, Chen Chen, Hong-You Chen, Tsu-Jui Fu, William Yang Wang, Shih-Fu Chang, Zhe Gan, Yinfei Yang:
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models. CoRR abs/2404.07973 (2024) - [i29]Yujie Lu, Xiujun Li, Tsu-Jui Fu, Miguel P. Eckstein, William Yang Wang:
From Text to Pixel: Advancing Long-Context Understanding in MLLMs. CoRR abs/2405.14213 (2024) - [i28]Jiachen Li, Weixi Feng, Tsu-Jui Fu, Xinyi Wang, Sugato Basu, Wenhu Chen, William Yang Wang:
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback. CoRR abs/2405.18750 (2024) - [i27]Weixi Feng, Jiachen Li, Michael Saxon, Tsu-Jui Fu, Wenhu Chen, William Yang Wang:
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation. CoRR abs/2406.08656 (2024) - 2023
- [c29]Tsu-Jui Fu, Licheng Yu, Ning Zhang, Cheng-Yang Fu, Jong-Chyi Su, William Yang Wang, Sean Bell:
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation. CVPR 2023: 10681-10692 - [c28]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling. CVPR 2023: 22898-22909 - [c27]Tsu-Jui Fu, Wenhan Xiong, Yixin Nie, Jingyu Liu, Barlas Oguz, William Wang:
Text-guided 3D Human Generation from 2D Collections. EMNLP (Findings) 2023: 4508-4520 - [c26]Siqi Liu, Weixi Feng, Tsu-Jui Fu, Wenhu Chen, William Wang:
EDIS: Entity-Driven Image Search over Multimodal Web Content. EMNLP 2023: 4877-4894 - [c25]Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Wang, Miguel P. Eckstein, William Wang:
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation. EMNLP 2023: 11113-11122 - [c24]Weixi Feng, Xuehai He, Tsu-Jui Fu, Varun Jampani, Arjun R. Akula, Pradyumna Narayana, Sugato Basu, Xin Eric Wang, William Yang Wang:
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis. ICLR 2023 - [c23]Weixi Feng, Wanrong Zhu, Tsu-Jui Fu, Varun Jampani, Arjun R. Akula, Xuehai He, Sugato Basu, Xin Eric Wang, William Yang Wang:
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models. NeurIPS 2023 - [c22]Jing Gu, Yilin Wang, Nanxuan Zhao, Tsu-Jui Fu, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, Hyunjoon Jung, Xin Eric Wang:
PHOTOSWAP: Personalized Subject Swapping in Images. NeurIPS 2023 - [i26]Xuehai He, Weixi Feng, Tsu-Jui Fu, Varun Jampani, Arjun R. Akula, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang:
Discriminative Diffusion Models as Few-shot Vision and Language Learners. CoRR abs/2305.10722 (2023) - [i25]Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Eric Wang, Miguel P. Eckstein, William Yang Wang:
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation. CoRR abs/2305.11317 (2023) - [i24]Tsu-Jui Fu, Wenhan Xiong, Yixin Nie, Jingyu Liu, Barlas Oguz, William Yang Wang:
Text-guided 3D Human Generation from 2D Collections. CoRR abs/2305.14312 (2023) - [i23]Weixi Feng, Wanrong Zhu, Tsu-Jui Fu, Varun Jampani, Arjun R. Akula, Xuehai He, Sugato Basu, Xin Eric Wang, William Yang Wang:
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models. CoRR abs/2305.15393 (2023) - [i22]Jing Gu, Yilin Wang, Nanxuan Zhao, Tsu-Jui Fu, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, Hyunjoon Jung, Xin Eric Wang:
Photoswap: Personalized Subject Swapping in Images. CoRR abs/2305.18286 (2023) - [i21]Raphael Schumann, Wanrong Zhu, Weixi Feng, Tsu-Jui Fu, Stefan Riezler, William Yang Wang:
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View. CoRR abs/2307.06082 (2023) - [i20]Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan:
Guiding Instruction-based Image Editing via Multimodal Large Language Models. CoRR abs/2309.17102 (2023) - 2022
- [c21]Tsu-Jui Fu, William Yang Wang, Daniel McDuff, Yale Song:
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents. AAAI 2022: 634-642 - [c20]Tsu-Jui Fu, Xin Eric Wang, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang:
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers. CVPR 2022: 10503-10512 - [c19]Tsu-Jui Fu, Xin Eric Wang, William Yang Wang:
Language-Driven Artistic Style Transfer. ECCV (36) 2022: 717-734 - [c18]Xuehai He, Diji Yang, Weixi Feng, Tsu-Jui Fu, Arjun R. Akula, Varun Jampani, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Wang:
CPL: Counterfactual Prompt Learning for Vision and Language Models. EMNLP 2022: 3407-3418 - [c17]Weixi Feng, Tsu-Jui Fu, Yujie Lu, William Yang Wang:
ULN: Towards Underspecified Vision-and-Language Navigation. EMNLP 2022: 6394-6412 - [i19]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling. CoRR abs/2209.01540 (2022) - [i18]Weixi Feng, Tsu-Jui Fu, Yujie Lu, William Yang Wang:
ULN: Towards Underspecified Vision-and-Language Navigation. CoRR abs/2210.10020 (2022) - [i17]Xuehai He, Diji Yang, Weixi Feng, Tsu-Jui Fu, Arjun R. Akula, Varun Jampani, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang:
CPL: Counterfactual Prompt Learning for Vision and Language Models. CoRR abs/2210.10362 (2022) - [i16]Tsu-Jui Fu, Licheng Yu, Ning Zhang, Cheng-Yang Fu, Jong-Chyi Su, William Yang Wang, Sean Bell:
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation. CoRR abs/2211.12824 (2022) - [i15]Weixi Feng, Xuehai He, Tsu-Jui Fu, Varun Jampani, Arjun R. Akula, Pradyumna Narayana, Sugato Basu, Xin Eric Wang, William Yang Wang:
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis. CoRR abs/2212.05032 (2022) - 2021
- [c16]Jhih-wei Chen, Tsu-Jui Fu, Chen-Kang Lee, Wei-Yun Ma:
H-FND: Hierarchical False-Negative Denoising for Distant Supervision Relation Extraction. ACL/IJCNLP (Findings) 2021: 2579-2593 - [c15]Wanrong Zhu, Xin Wang, Tsu-Jui Fu, An Yan, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang:
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation. EACL 2021: 1207-1221 - [c14]An Yan, Xin Wang, Tsu-Jui Fu, William Yang Wang:
L2C: Describing Visual Differences Needs Semantic Understanding of Individuals. EACL 2021: 2315-2320 - [c13]Tsu-Jui Fu, William Yang Wang:
Semi-Supervised Policy Initialization for Playing Games with Language Hints. NAACL-HLT 2021: 3112-3116 - [i14]Tsu-Jui Fu, William Yang Wang, Daniel McDuff, Yale Song:
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents. CoRR abs/2101.11796 (2021) - [i13]An Yan, Xin Eric Wang, Tsu-Jui Fu, William Yang Wang:
L2C: Describing Visual Differences Needs Semantic Understanding of Individuals. CoRR abs/2102.01860 (2021) - [i12]Tsu-Jui Fu, Xin Eric Wang, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang:
Language-based Video Editing via Multi-Modal Multi-Level Transformer. CoRR abs/2104.01122 (2021) - [i11]Tsu-Jui Fu, Xin Eric Wang, William Yang Wang:
Language-Driven Image Style Transfer. CoRR abs/2106.00178 (2021) - [i10]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling. CoRR abs/2111.12681 (2021) - 2020
- [c12]Peng-Hsuan Li, Tsu-Jui Fu, Wei-Yun Ma:
Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER. AAAI 2020: 8236-8244 - [c11]Tsu-Jui Fu, Xin Eric Wang, Matthew F. Peterson, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang:
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler. ECCV (6) 2020: 71-86 - [c10]Tsu-Jui Fu, Xin Wang, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang:
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning. EMNLP (1) 2020: 4413-4422 - [i9]Wanrong Zhu, Xin Wang, Tsu-Jui Fu, An Yan, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang:
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation. CoRR abs/2007.00229 (2020) - [i8]Tsu-Jui Fu, Xin Eric Wang, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang:
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning. CoRR abs/2009.09566 (2020) - [i7]Jhih-Wei Chen, Tsu-Jui Fu, Chen-Kang Lee, Wei-Yun Ma:
H-FND: Hierarchical False-Negative Denoising for Distant Supervision Relation Extraction. CoRR abs/2012.03536 (2020)
2010 – 2019
- 2019
- [c9]Tsu-Jui Fu, Peng-Hsuan Li, Wei-Yun Ma:
GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction. ACL (1) 2019: 1409-1418 - [c8]Zhang-Wei Hong, Tsu-Jui Fu, Tzu-Yun Shann, Chun-Yi Lee:
Adversarial Active Exploration for Inverse Dynamics Model Learning. CoRL 2019: 552-565 - [c7]Hsuan-Kung Yang, Tsu-Jui Fu, Po-Han Chiang, Kuan-Wei Ho, Chun-Yi Lee:
A Distributed Scheme for Accelerating Semantic Video Segmentation on An Embedded Cluster. ICCD 2019: 73-81 - [c6]Tsu-Jui Fu, Shao-Heng Tai, Hwann-Tzong Chen:
Attentive and Adversarial Learning for Video Summarization. WACV 2019: 1579-1587 - [i6]Tsu-Jui Fu, Yuta Tsuboi, Sosuke Kobayashi, Yuta Kikuchi:
Learning from Observation-Only Demonstration for Task-Oriented Language Grounding via Self-Examination. ViGIL@NeurIPS 2019 - [i5]Peng-Hsuan Li, Tsu-Jui Fu, Wei-Yun Ma:
Remedying BiLSTM-CNN Deficiency in Modeling Cross-Context for NER. CoRR abs/1908.11046 (2019) - [i4]Tsu-Jui Fu, Xin Wang, Matthew F. Peterson, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang:
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling. CoRR abs/1911.07308 (2019) - 2018
- [c5]Kang-Jun Liu, Tsu-Jui Fu, Shan-Hung Wu:
Region-Semantics Preserving Image Synthesis. ACCV (4) 2018: 322-337 - [c4]Yu-Syuan Xu, Tsu-Jui Fu, Hsuan-Kung Yang, Chun-Yi Lee:
Dynamic Video Segmentation Network. CVPR 2018: 6556-6565 - [c3]Hsuan-Kung Yang, An-Chieh Cheng, Kuan-Wei Ho, Tsu-Jui Fu, Chun-Yi Lee:
Visual Relationship Prediction via Label Clustering and Incorporation of Depth Information. ECCV Workshops (2) 2018: 571-581 - [c2]Tsu-Jui Fu, Wei-Yun Ma:
Speed Reading: Learning to Read ForBackward via Shuttle. EMNLP 2018: 4439-4448 - [c1]Zhang-Wei Hong, Tzu-Yun Shann, Shih-Yang Su, Yi-Hsiang Chang, Tsu-Jui Fu, Chun-Yi Lee:
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning. NeurIPS 2018: 10510-10521 - [i3]Yu-Syuan Xu, Tsu-Jui Fu, Hsuan-Kung Yang, Chun-Yi Lee:
Dynamic Video Segmentation Network. CoRR abs/1804.00931 (2018) - [i2]Zhang-Wei Hong, Tsu-Jui Fu, Tzu-Yun Shann, Yi-Hsiang Chang, Chun-Yi Lee:
Adversarial Exploration Strategy for Self-Supervised Imitation Learning. CoRR abs/1806.10019 (2018) - [i1]Hsuan-Kung Yang, An-Chieh Cheng, Kuan-Wei Ho, Tsu-Jui Fu, Chun-Yi Lee:
Visual Relationship Prediction via Label Clustering and Incorporation of Depth Information. CoRR abs/1809.02945 (2018)
Coauthor Index
aka: Xin Eric Wang
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-19 20:45 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint