User contributions for Vincent
Appearance
5 July 2024
- 14:2014:20, 5 July 2024 diff hist −185 Translations:Q&A/4/nl No edit summary current
- 14:2014:20, 5 July 2024 diff hist +311 N Translations:Q&A/68/nl Created page with "Op de [https://kdutch.ivdnt.org/wiki/K-Dutch#Corpora hoofdpagina] vindt u een lijst met verschillende soorten corpora die wij hebben. Domeinspecifieke corpora zijn de Parlementaire corpora en de Corpora van academische teksten. Onder de Parallele corpora bevinden zich ook domeinspecifieke corpora."
- 14:1914:19, 5 July 2024 diff hist −312 Translations:Q&A/3/nl No edit summary current
- 14:1914:19, 5 July 2024 diff hist +11,999 Q&A/nl Created page with "Het is mogelijk om een account aan te vragen via de [https://idm.clarin.eu/unitygw/pub#!registration-CLARIN%20Identity%20Registration CLARIN-accountregistratiepagina]."
- 14:1914:19, 5 July 2024 diff hist +167 N Translations:Q&A/67/nl Created page with "Het is mogelijk om een account aan te vragen via de [https://idm.clarin.eu/unitygw/pub#!registration-CLARIN%20Identity%20Registration CLARIN-accountregistratiepagina]." current
- 14:1914:19, 5 July 2024 diff hist −168 Translations:Q&A/2/nl No edit summary current
- 14:1914:19, 5 July 2024 diff hist +336 Q&A Marked this version for translation
- 14:1814:18, 5 July 2024 diff hist +28 Q&A No edit summary
- 14:1614:16, 5 July 2024 diff hist 0 m Dictionaries →4-Language Finance, Economy & Business Terminology (NE-EN-FR-DE) current Tag: Visual edit
- 14:1614:16, 5 July 2024 diff hist −40 Dictionaries/nl No edit summary current Tag: Manual revert
2 July 2024
- 13:4013:40, 2 July 2024 diff hist −251 Simplification Data/nl Created page with "Gecreëerd in het kader van de masterthesis van Charlotte Van de Velde. De dataset bevat Nederlandse bronzinnen gealigneerd met vereenvoudigde zinnen, beide gegenereerd door ChatGPT. Alles gecombineerd bestaat de dataset uit 1267 ingangen."
- 13:3913:39, 2 July 2024 diff hist +90 N Translations:Simplification Data/28/nl Created page with "*265 KB *[https://github.com/Amsterdam-AI-Team/dutch-municipal-text-simplification Github]" current
- 13:3813:38, 2 July 2024 diff hist +480 N Translations:Simplification Data/27/nl Created page with "Het Nederlandse gemeentelijke corpus is een parallel monolinguaal corpus voor de evaluatie van zinsvereenvoudiging in het Nederlandse gemeentelijke domein. Het corpus is gemaakt door Amsterdam Intelligence. Het bevat 1.311 vertaalde parallelle zinsparen die automatisch gealigneerd werden. De zinsparen zijn afkomstig uit 50 documenten van de communicatieafdeling van de gemeente Amsterdam die handmatig werden vereenvoudigd om de vereenvoudiging voor het Nederlands te evalu..." current
- 13:3813:38, 2 July 2024 diff hist +36 N Translations:Simplification Data/26/nl Created page with "===Nederlandse gemeentelijke data===" current
- 13:3813:38, 2 July 2024 diff hist +25 N Translations:Simplification Data/25/nl Created page with "==Manueel vereenvoudigd==" current
- 13:3813:38, 2 July 2024 diff hist +106 N Translations:Simplification Data/24/nl Created page with "* [https://huggingface.co/datasets/BramVanroy/chatgpt-dutch-simplification Downloadpagina (CSV-bestanden)]" current
- 13:3813:38, 2 July 2024 diff hist +96 N Translations:Simplification Data/23/nl Created page with "# Training = 1013 zinnen (262 KB) # Validatie = 126 zinnen (32.6 KB) # Test = 128 zinnen (33 KB)" current
- 13:3713:37, 2 July 2024 diff hist +239 N Translations:Simplification Data/22/nl Created page with "Gecreëerd in het kader van de masterthesis van Charlotte Van de Velde. De dataset bevat Nederlandse bronzinnen gealigneerd met vereenvoudigde zinnen, beide gegenereerd door ChatGPT. Alles gecombineerd bestaat de dataset uit 1267 ingangen." current
- 13:3613:36, 2 July 2024 diff hist +54 N Translations:Simplification Data/21/nl Created page with "===Door ChatGPT gegenereerde dataset (Van de Velde)===" current
- 11:3311:33, 2 July 2024 diff hist −804 Simplification Data/nl Created page with "Het ASSET simplificatiecorpus (Alva-Manchego et al, 2020) is automatisch vertaald naar het Nederlands (Seidl et al., 2023), en is vrij beschikbaar."
- 11:3111:31, 2 July 2024 diff hist +86 N Translations:Simplification Data/20/nl Created page with "* 3.02 MB * [https://huggingface.co/datasets/UWV/veringewikkelderingen Downloadpagina]" current
- 11:3111:31, 2 July 2024 diff hist +267 N Translations:Simplification Data/19/nl Created page with "Een uitgebreidere versie van deze dataset werd gemaakt door Michiel Buisman en Bram Vanroy. Deze dataset bevat een eerste, kleine set variaties van Wikipediaparagrafen in verschillende stijlen (jargon, officieel, archaïsche taal, technisch, academisch en poëtisch)." current
- 11:3111:31, 2 July 2024 diff hist +93 N Translations:Simplification Data/18/nl Created page with "* [https://huggingface.co/datasets/UWV/Leesplank_NL_wikipedia_simplifications Downloadpagina]" current
- 11:3111:31, 2 July 2024 diff hist +122 N Translations:Simplification Data/17/nl Created page with "* [https://huggingface.co/datasets/UWV/Leesplank_NL_wikipedia_simplifications/blob/main/README.md HuggingFace ReadMe file]" current
- 11:3011:30, 2 July 2024 diff hist +371 N Translations:Simplification Data/16/nl Created page with "Data bevat 2,391,206 pragrafen van prompt/resultaat-combinatiess, waar het prompt een paragraaf uit de Nederlandse Wikipedia is en het resultaat een vereenvoudigde tekst is, die een of meer paragrafen kan bevatten. Deze dataset werd gecreëerd door UWV, als onderdeel van project "Leesplank", een inspanning om datasets te genereren die ethisch en wettelijk in orde zijn." current
- 11:2811:28, 2 July 2024 diff hist +32 N Translations:Simplification Data/15/nl Created page with "===UWV Leesplank NL wikipedia===" current
- 11:2811:28, 2 July 2024 diff hist +25 N Translations:Simplification Data/14/nl Created page with "==Synthetische datasets==" current
- 11:2811:28, 2 July 2024 diff hist +491 N Translations:Simplification Data/13/nl Created page with "*[https://github.com/nivack/comparable_corpus_Wablieft_deStandaard Github] *[https://kuleuven.limo.libis.be/discovery/fulldisplay?docid=alma9993153812401488&context=L&vid=32KUL_KUL:KULeuven&lang=en&search_scope=All_Content&adaptor=Local%20Search%20Engine&tab=all_content_tab&query=any,contains,nick%20vanackere&offset=0 Vanackere, N., & Vandeghinste, V. (2022). Building a comparable corpus between easy-to-read Dutch Wablieft and De Standaard. KU Leuven. Faculteit Ingenieur..." current
- 11:2811:28, 2 July 2024 diff hist +477 N Translations:Simplification Data/12/nl Created page with "Corpus gecreëerd door Nick Vanackere. Het bevat 12.687 Wablieft-artikelen uit de periode 2012-2017 en 206.466 De Standaard-artikelen uit de periode 2013-2017. Om de vergelijkbaarheid te garanderen, werden alleen artikels van 08/01/2013 tot 16/11/2017 bekeken, wat resulteerde in 8.744 Wablieft-artikels en 202.284 De Standaard-artikels. Het verschil in het aantal artikelen is te wijten aan de verschijningsfrequentie: Wablieft verschijnt wekelijks en De Standaard dagelijks." current
- 11:2711:27, 2 July 2024 diff hist +46 N Translations:Simplification Data/11/nl Created page with "==Vergelijkbaar Corpus Wablieft De Standaard==" current
- 11:2711:27, 2 July 2024 diff hist +116 N Translations:Simplification Data/10/nl Created page with "* 8.67 MB * [https://huggingface.co/datasets/NetherlandsForensicInstitute/simplewiki-translated-nl Download dataset]" current
- 11:2711:27, 2 July 2024 diff hist +328 N Translations:Simplification Data/9/nl Created page with "Vertaalde dataset gecreëerd door het Nederlands Forensisch Instituut met Meta's [https://ai.meta.com/research/no-language-left-behind/ No Language Left Behind model]. Het bevat 167000 gealigneerde zinsparen en doet dienst als de Nederlandse vertaling van de SimpleWiki [https://cs.pomona.edu/~dkauchak/simplification/ dataset]." current
- 11:2511:25, 2 July 2024 diff hist +28 N Translations:Simplification Data/8/nl Created page with "===NFI SimpleWiki dataset===" current
- 11:2511:25, 2 July 2024 diff hist +791 N Translations:Simplification Data/7/nl Created page with "*[https://github.com/tsei902/simplify_dutch/tree/main/resources/datasets/wikilarge Github download] * <small>Seidl, T., Vandeghinste, V., & Van de Cruys, T. (2023). [https://kuleuven.limo.libis.be/discovery/fulldisplay?docid=alma9993527112601488&context=L&vid=32KUL_KUL:KULeuven&lang=en&search_scope=All_Content&adaptor=Local%20Search%20Engine&tab=all_content_tab&query=any,contains,seidl%20theresa&offset=0 Controllable Sentence Simplification in Dutch]. KU Leuven. Facultei..."
- 11:2511:25, 2 July 2024 diff hist +166 N Translations:Simplification Data/6/nl Created page with "Automatische vertaling van de Wikilarge dataset, nuttig voor automatische vereenvoudiging (Seidl et al., 2023). Vrij beschikbaar. Originele dataset van Zhang & Lapata" current
- 11:2411:24, 2 July 2024 diff hist +23 N Translations:Simplification Data/5/nl Created page with "===Wikilarge Dataset===" current
- 11:2311:23, 2 July 2024 diff hist +761 N Translations:Simplification Data/4/nl Created page with "*[https://github.com/tsei902/simplify_dutch/tree/main/resources/datasets/asset Github download] * <small>Alva-Manchego, F., Martin, L., Bordes, A., Scarton, C., Sagot, B., & Specia, L. (2020). ASSET: A dataset for tuning and evaluation of sentence simplification models with multiple rewriting transformations. arXiv preprint arXiv:2005.00481.</small> * <small>Seidl, T., Vandeghinste, V., & Van de Cruys, T. (2023). [https://kuleuven.limo.libis.be/discovery/fulldisplay?doci..."
- 11:2311:23, 2 July 2024 diff hist +147 N Translations:Simplification Data/3/nl Created page with "Het ASSET simplificatiecorpus (Alva-Manchego et al, 2020) is automatisch vertaald naar het Nederlands (Seidl et al., 2023), en is vrij beschikbaar."
- 11:2311:23, 2 July 2024 diff hist +7,016 N Simplification Data/nl Created page with "Simplificatiedata"
- 11:2211:22, 2 July 2024 diff hist +31 N Translations:Simplification Data/2/nl Created page with "===ASSET Simplificatiecorpus===" current
- 11:2211:22, 2 July 2024 diff hist +34 N Translations:Simplification Data/1/nl Created page with "==Automatisch vertaalde datasets==" current
- 11:2211:22, 2 July 2024 diff hist +17 N Translations:Simplification Data/Page display title/nl Created page with "Simplificatiedata" current
- 11:2211:22, 2 July 2024 diff hist +327 Simplification Data Marked this version for translation
- 11:2211:22, 2 July 2024 diff hist +1 Simplification Data No edit summary
- 07:5107:51, 2 July 2024 diff hist +455 Simplification Data →Comparable Corpus Wablieft De Standaard Tag: Visual edit: Switched
26 June 2024
- 14:4714:47, 26 June 2024 diff hist +232 Dictionaries No edit summary
24 June 2024
- 12:4812:48, 24 June 2024 diff hist −1,473 Language modeling/nl Replaced content with "==N-gram-modellering=="
- 12:4712:47, 24 June 2024 diff hist −445 Translations:Language modeling/1/nl Replaced content with "==N-gram-modellering==" current Tag: Replaced
- 12:4712:47, 24 June 2024 diff hist +61 N Translations:Language modeling/19/nl Created page with "*[https://scandeval.com/dutch-nlg/ Scandeval Nederlandse NLG]" current
- 12:4712:47, 24 June 2024 diff hist +32 N Translations:Language modeling/18/nl Created page with "===Scandeval Nederlandse NLG ===" current