GitHub - woodongk/story-blender: 🤖 ➡︎ 🧑🏻‍🦰 Human-Like Story Generation from Caption Using Seq2Seq Model

seq2seq 모델을 적용시킨 스토리 문장 생성 연구

︎Human-Like Story Generation from Caption Using Seq2Seq Model ( 🤖 machine-like ➡️ 🧑 human-like )
2018년도 아주대학교 미디어학과 졸업 프로젝트 최우수상 수상

"the fireworks are shooting off in the sky" -> [Seq2Seq model] -> "the fireworks were beautiful"

Sequence-to-Sequence (Seq2Seq) 모델은 주로 한 도메인인(예: 한국어 문장)에서 다른 도메인(예: 영어로 번역된 동일한 문장)의 sequence로 sequence를 변환하기 위한 모델을 말한다.
"기계가 생성한 딱딱한 문장을 인간이 쓴 듯한 언어로 변형하면 어떨까?"라는 단순한 생각에서 시작하게 된 프로젝트

본 프로젝트를 위한 데이터로 마이크로소프트 사에서 제공하는 VIST(Visual Storytelling Dataset)을 사용함
VIST는 주로 image captioning task에 쓰이는 데이터셋으로, 특정 이벤트로 묶인 순차적인 이미지들을 각각 캡션 문장(descriptions for images in isolation, DII)과 순차적인 스토리 문장(stories for images in sequence, SIS)의 쌍으로 제공
image captioning task에 쓰이는 기술은 현 시점에서 매우 발전되어 있기에 데이터셋 또한 쉽게 구할 수 있었음
[Code]

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
dataset		dataset
generation_results		generation_results
imgs		imgs
model_output		model_output
reference		reference
.gitignore		.gitignore
1. VIST_data preprocessing.ipynb		1. VIST_data preprocessing.ipynb
2. Seq2Seq - keras.ipynb		2. Seq2Seq - keras.ipynb
2. Seq2Seq with attention - tensorflow.ipynb		2. Seq2Seq with attention - tensorflow.ipynb
README.md		README.md
Seq2Seq_BILSTM_1124_learning30000_emb_size_150-Copy1.ipynb		Seq2Seq_BILSTM_1124_learning30000_emb_size_150-Copy1.ipynb