This repo contains a Chinese-English real & fake news dataset according to existing English fact-checking information. Details on this dataset are described in Dataset Detail. The highlights of our dataset are as follows: Bilingual news pieces for the same event (fact). Multiple Chinese news pieces for the same event … See more The COVID-19 pandemic poses a significant threat to global public health. Meanwhile, there is massive misinformation associated with the pandemic, which advocates unfounded or unscientific claims. … See more Given the current dataset, some future research directions include: 1. The writing style/sentiment/stance differences between fake news and real news. 2. The writing … See more The table below shows the number of annotated news in each language: The metadata of our dataset can be found at CrossFake_metadata.xlsx, … See more Besides the findings and conclusions presented in our paper. We have extra interesting findings during collecting the data: 1. Mixed Fact.For some fake news, their corresponding … See more WebOct 2, 2024 · In this work, we construct a large-scale cleaned Chinese conversation dataset called LCCC, which contains two versions, LCCC-base and LCCC-large. LCCC-base is filtered from 79 million conversations crawled from Weibo, while LCCC-large is filtered from the combination of Weibo data and other sources of Chinese corpora.
A Large-Scale Chinese Multimodal NER Dataset with Speech Clues
WebAt the same time, financial events are filtered from public dataset DuEE to construct dataset DuEE_Fin. As the experimental results show that the proposed Chinese financial event extraction model Roberta-BilSTM-CRF has improved accuracy, recall rate, and F1 score compared with existing models on FinEE and DuEE_Fin datasets. Webis a large-scale news dataset scraped from 38 major news publications, ranging from business to sports. These summaries are often provided by editors and journalists for … pool table rack position
MMED: A Multi-domain and Multi-modality Event Dataset
WebAug 24, 2024 · Misinformation posted on social media during COVID-19 is one main example of infodemic data. This phenomenon was prominent in China when COVID-19 happened at the beginning. While a lot of data can be collected from various social media platforms, publicly available infodemic detection data remains rare and is not easy to … WebThis is the first Chinese news dataset that has both hierarchical topic labels and article full texts. And it is also the largest Chinese news topic dataset. We describe the data … WebHere are 45 Best Chinese News Websites you must follow in 2024. 1. Ecns. Ecns.cn is the official English-language website of China News Service (CNS), providing latest news, … shared ownership aldershot