The dataset covers 82% of the Shi imageries. Paintings with the Shi imageries. It signifies the paintings in Zikai-Caption are stylistically similar to Zikai-Poem. The second auxiliary dataset to enhance the small Zikai-Poem dataset is TCP-Poem, which provides a very large-scale paintings of traditional Chinese painting style paired with poems. Every instance in the dataset accommodates the next metadata: PoemID, PoemText, PoemTitle, PoemDynasty, PoemAuthor, Rationalization, Commentary, and PaintingID. To be specific, we're going to answer the next analysis questions.

Negative examples within the coaching data of the picture type classification model introduced in Part 2.3.3. It's discovered that about 85% of the paintings are of Chinese language painting style. How do mannequin hyperparameters have an effect on model efficiency? Image type classification. It's reasonable to assume that the candidate images are semantically relevant to the poem question as a result of that is the designing objective of a search engine. Candidate images retrieval. We use web search engines like google and yahoo to retrieve a set of candidate pictures for each poem, given that modern search engines provide efficient and environment friendly image search services and they'll question high volumes of images. The damaging examples are 346 negative paintings manually labelled from the candidate painting collection.

Even so, TCP-Poem is still a good assortment of classical Chinese paintings as analyzed above. Poems range. The poems in TCP-Poem are chosen across totally different dynasty durations. The metadata respectively describe: the poem identification quantity, the text of the poem, the poem title, the dynasty the poem is written in, the writer of the poem, notes on unusual words, the poem commentary, and the painting identification quantity. This metadata respectively describe: the poem identification quantity, the caption of the painting, and the painting identification quantity. Then we compute the percentage of poem imageries that are literally portrayed within the corresponding paintings. The paintings are cropped as a way to remove all unnecessary information present on them. Every painting is related to a poem as its theme, an evidence in modern Chinese, and different commentary texts offering vital background information to understand the poem.