User contributions for Vincent
Appearance
2 July 2024
- 11:2811:28, 2 July 2024 diff hist +25 N Translations:Simplification Data/14/nl Created page with "==Synthetische datasets==" current
- 11:2811:28, 2 July 2024 diff hist +491 N Translations:Simplification Data/13/nl Created page with "*[https://github.com/nivack/comparable_corpus_Wablieft_deStandaard Github] *[https://kuleuven.limo.libis.be/discovery/fulldisplay?docid=alma9993153812401488&context=L&vid=32KUL_KUL:KULeuven&lang=en&search_scope=All_Content&adaptor=Local%20Search%20Engine&tab=all_content_tab&query=any,contains,nick%20vanackere&offset=0 Vanackere, N., & Vandeghinste, V. (2022). Building a comparable corpus between easy-to-read Dutch Wablieft and De Standaard. KU Leuven. Faculteit Ingenieur..." current
- 11:2811:28, 2 July 2024 diff hist +477 N Translations:Simplification Data/12/nl Created page with "Corpus gecreëerd door Nick Vanackere. Het bevat 12.687 Wablieft-artikelen uit de periode 2012-2017 en 206.466 De Standaard-artikelen uit de periode 2013-2017. Om de vergelijkbaarheid te garanderen, werden alleen artikels van 08/01/2013 tot 16/11/2017 bekeken, wat resulteerde in 8.744 Wablieft-artikels en 202.284 De Standaard-artikels. Het verschil in het aantal artikelen is te wijten aan de verschijningsfrequentie: Wablieft verschijnt wekelijks en De Standaard dagelijks." current
- 11:2711:27, 2 July 2024 diff hist +46 N Translations:Simplification Data/11/nl Created page with "==Vergelijkbaar Corpus Wablieft De Standaard==" current
- 11:2711:27, 2 July 2024 diff hist +116 N Translations:Simplification Data/10/nl Created page with "* 8.67 MB * [https://huggingface.co/datasets/NetherlandsForensicInstitute/simplewiki-translated-nl Download dataset]" current
- 11:2711:27, 2 July 2024 diff hist +328 N Translations:Simplification Data/9/nl Created page with "Vertaalde dataset gecreëerd door het Nederlands Forensisch Instituut met Meta's [https://ai.meta.com/research/no-language-left-behind/ No Language Left Behind model]. Het bevat 167000 gealigneerde zinsparen en doet dienst als de Nederlandse vertaling van de SimpleWiki [https://cs.pomona.edu/~dkauchak/simplification/ dataset]." current
- 11:2511:25, 2 July 2024 diff hist +28 N Translations:Simplification Data/8/nl Created page with "===NFI SimpleWiki dataset===" current
- 11:2511:25, 2 July 2024 diff hist +791 N Translations:Simplification Data/7/nl Created page with "*[https://github.com/tsei902/simplify_dutch/tree/main/resources/datasets/wikilarge Github download] * <small>Seidl, T., Vandeghinste, V., & Van de Cruys, T. (2023). [https://kuleuven.limo.libis.be/discovery/fulldisplay?docid=alma9993527112601488&context=L&vid=32KUL_KUL:KULeuven&lang=en&search_scope=All_Content&adaptor=Local%20Search%20Engine&tab=all_content_tab&query=any,contains,seidl%20theresa&offset=0 Controllable Sentence Simplification in Dutch]. KU Leuven. Facultei..."
- 11:2511:25, 2 July 2024 diff hist +166 N Translations:Simplification Data/6/nl Created page with "Automatische vertaling van de Wikilarge dataset, nuttig voor automatische vereenvoudiging (Seidl et al., 2023). Vrij beschikbaar. Originele dataset van Zhang & Lapata" current
- 11:2411:24, 2 July 2024 diff hist +23 N Translations:Simplification Data/5/nl Created page with "===Wikilarge Dataset===" current
- 11:2311:23, 2 July 2024 diff hist +761 N Translations:Simplification Data/4/nl Created page with "*[https://github.com/tsei902/simplify_dutch/tree/main/resources/datasets/asset Github download] * <small>Alva-Manchego, F., Martin, L., Bordes, A., Scarton, C., Sagot, B., & Specia, L. (2020). ASSET: A dataset for tuning and evaluation of sentence simplification models with multiple rewriting transformations. arXiv preprint arXiv:2005.00481.</small> * <small>Seidl, T., Vandeghinste, V., & Van de Cruys, T. (2023). [https://kuleuven.limo.libis.be/discovery/fulldisplay?doci..."
- 11:2311:23, 2 July 2024 diff hist +147 N Translations:Simplification Data/3/nl Created page with "Het ASSET simplificatiecorpus (Alva-Manchego et al, 2020) is automatisch vertaald naar het Nederlands (Seidl et al., 2023), en is vrij beschikbaar."
- 11:2311:23, 2 July 2024 diff hist +7,016 N Simplification Data/nl Created page with "Simplificatiedata"
- 11:2211:22, 2 July 2024 diff hist +31 N Translations:Simplification Data/2/nl Created page with "===ASSET Simplificatiecorpus===" current
- 11:2211:22, 2 July 2024 diff hist +34 N Translations:Simplification Data/1/nl Created page with "==Automatisch vertaalde datasets==" current
- 11:2211:22, 2 July 2024 diff hist +17 N Translations:Simplification Data/Page display title/nl Created page with "Simplificatiedata" current
- 11:2211:22, 2 July 2024 diff hist +327 Simplification Data Marked this version for translation
- 11:2211:22, 2 July 2024 diff hist +1 Simplification Data No edit summary
- 07:5107:51, 2 July 2024 diff hist +455 Simplification Data →Comparable Corpus Wablieft De Standaard Tag: Visual edit: Switched
26 June 2024
- 14:4714:47, 26 June 2024 diff hist +232 Dictionaries No edit summary
24 June 2024
- 12:4812:48, 24 June 2024 diff hist −1,473 Language modeling/nl Replaced content with "==N-gram-modellering=="
- 12:4712:47, 24 June 2024 diff hist −445 Translations:Language modeling/1/nl Replaced content with "==N-gram-modellering==" current Tag: Replaced
- 12:4712:47, 24 June 2024 diff hist +61 N Translations:Language modeling/19/nl Created page with "*[https://scandeval.com/dutch-nlg/ Scandeval Nederlandse NLG]" current
- 12:4712:47, 24 June 2024 diff hist +32 N Translations:Language modeling/18/nl Created page with "===Scandeval Nederlandse NLG ===" current
- 12:4612:46, 24 June 2024 diff hist +73 N Translations:Language modeling/17/nl Created page with "Dit is een scorebord voor Nederlandse benchmarks voor grote taalmodellen." current
- 12:4612:46, 24 June 2024 diff hist −74 Translations:Language modeling/9/nl No edit summary current
- 12:4612:46, 24 June 2024 diff hist +497 N Translations:Language modeling/16/nl Created page with "DUMB is een benchmark voor het evalueren van de kwaliteit van taalmodellen voor Nederlandse natuurlijketaalverwerkingstaken. De set met taken is ontworpen om divers en uitdagend te zijn en de limieten van de bestaande taalmodellen te testen. De specifieke datasets en formaten zijn met name geschikt voor het finetunen van encodermodellen en toepasbaarheid voor grote generatieve modellen moet nog worden vastgesteld. Meer details zijn te lezen in het paper dat via onderstaa..." current
- 12:4612:46, 24 June 2024 diff hist +10 N Translations:Language modeling/15/nl Created page with "===DUMB===" current
- 12:4612:46, 24 June 2024 diff hist −509 Translations:Language modeling/7/nl Replaced content with "== Taalmodelleringsbenchmarks ==" current Tag: Replaced
- 12:4512:45, 24 June 2024 diff hist +83 N Translations:Language modeling/14/nl Created page with "spaCy is een gratis opensourcebibliotheek voor natuurlijketaalverwerking in Python." current
- 12:4512:45, 24 June 2024 diff hist −84 Translations:Language modeling/5/nl Replaced content with "==SpaCy==" current Tag: Replaced
- 12:4512:45, 24 June 2024 diff hist −330 Language modeling/nl Created page with "'Colibri core' is een natuurlijketaalverwerkingstool alsook een C++- en Python-bibliotheek voor het werken met standaard taalkundige constructies zoals n-grams en skipgrams (d.w.z. patronen met een of meerdere gaten van ofwel vaststaande, ofwel dynamische grootte) op een snelle en geheugenefficiënte manier. In de kern bevindt zich de colibri-patroonmodelleerder die het mogelijk maakt om querypatternmodellen te bouwen, bekijken en bewerken."
- 12:4512:45, 24 June 2024 diff hist +96 N Translations:Language modeling/13/nl Created page with "* [https://openai.com/ GPT-3] * [https://huggingface.co/docs/transformers/model_doc/mbart MBart]"
- 12:4512:45, 24 June 2024 diff hist −97 Translations:Language modeling/4/nl No edit summary current
- 12:4512:45, 24 June 2024 diff hist +387 N Translations:Language modeling/12/nl Created page with "* [https://huggingface.co/models?search=dutch Hugging Face Dutch Models] * [https://people.cs.kuleuven.be/~pieter.delobelle/robbert/ RobBERT]: Een Nederlands RoBERTa-taalmodel * [https://github.com/wietsedv/bertje BERTje]: Een Nederlands BERT-model * [https://github.com/Rijgersberg/GEITje GEITje]: Een groot open taalmodel * [https://huggingface.co/Tweeties/tweety-7b-dutch-v24a Tweety]" current
- 12:4512:45, 24 June 2024 diff hist −324 Translations:Language modeling/3/nl Replaced content with "==Grote Taalmodellen==" current Tag: Replaced
- 12:4412:44, 24 June 2024 diff hist +444 N Translations:Language modeling/11/nl Created page with "'Colibri core' is een natuurlijketaalverwerkingstool alsook een C++- en Python-bibliotheek voor het werken met standaard taalkundige constructies zoals n-grams en skipgrams (d.w.z. patronen met een of meerdere gaten van ofwel vaststaande, ofwel dynamische grootte) op een snelle en geheugenefficiënte manier. In de kern bevindt zich de colibri-patroonmodelleerder die het mogelijk maakt om querypatternmodellen te bouwen, bekijken en bewerken." current
- 12:4412:44, 24 June 2024 diff hist +24 Language modeling Marked this version for translation
17 June 2024
- 10:1110:11, 17 June 2024 diff hist −4,072 Parallel Monolingual Corpora info moved to simplification page current
- 10:1010:10, 17 June 2024 diff hist +2 Simplification Data No edit summary
- 10:1010:10, 17 June 2024 diff hist +99 Simplification Data No edit summary
- 10:0610:06, 17 June 2024 diff hist +466 Simplification Data →ChatGPT generated dataset by Van de Velde
- 10:0210:02, 17 June 2024 diff hist +829 Simplification Data No edit summary
- 09:5609:56, 17 June 2024 diff hist +351 Simplification Data →Dutch municipal data
- 09:5509:55, 17 June 2024 diff hist −1 Simplification Data →=Dutch municipal data
- 09:5409:54, 17 June 2024 diff hist +256 Simplification Data No edit summary
- 09:5209:52, 17 June 2024 diff hist +58 N Talk:Parallel Monolingual Corpora/en Created page with "Check for double information with Simplification data site" current
13 June 2024
- 14:5214:52, 13 June 2024 diff hist −3,227 K-Dutch/nl Created page with "<!-- ===Informatie-extractie!--> <!--* Het verwerken van historische varianten van het Nederlands!--> <!--* Tekst-mining!-->"
- 14:3914:39, 13 June 2024 diff hist +10 Translations:K-Dutch/156/nl No edit summary current
- 14:3914:39, 13 June 2024 diff hist +14 Translations:K-Dutch/155/nl No edit summary current