Vyhledávání publikací

Vybrané publikace/výsledky

Grammar Development for Czech Syntactic Parser with Corpus-based Techniques (2006)
Dependency and Phrasal Parsers of the Czech Language: A Comparison (2007)
Reducing the Number of Resulting Parsing Trees for the Czech Language Using the Beautified Chart Method (2007)
Corpus Query System Bonito - Recent Development (2007)
Displaying Bidirectional Text Concordances in KWIC format (2007)
Power Networks Dialogues - Automatic Analysis and Evaluation of a Domain-Specific Text Corpus (2007)
New Methods for Pruning and Ordering of Syntax Parsing Trees (2008)
Test Suite for the Czech Parser Synt (2008)
Power Networks Dialogs - Enhancing Domain-Specific Text Processing Techniques and Resources (2008)
Syntaktická analýza s využitím postupné segmentace věty (2008)
Czech Vulgarisms in Text Corpora (2009)
Mining Phrases from Syntactic Analysis (2009)
Syntactic Analysis as Pattern Matching: The SET Parsing System (2009)
Measuring Coverage of a Valency Lexicon using Full Syntactic Analysis (2009)
Prague Dependency Treebank Annotation Errors: A Preliminary Analysis (2009)
Associating Collocations with Dictionary Senses (2009)
Punctuation Detection with Full Syntactic Parsing (2010)
A Quantitative Evaluation of Word Sketches (2010)
Fast syntactic searching in very large corpora for many languages (2010)
Through Low-Cost Annotation to Reliable Parsing Evaluation (2010)
How to Analyze Natural Language with Transparent Intensional Logic? (2010)
CzechParl: Corpus of Stenographic Protocols from Czech Parliament (2010)
Syntactic Analysis Using Finite Patterns: A New Parsing System for Czech (2011)
Tickbox Lexicography (2010)
wwwsynt - webové rozhraní analyzátoru synt (2010)
OOCorr (2010)
E-learning punctuation test based on the error corpus (2010)
Effective Parsing Using Competing CFG Rules (2011)
Japanese Word Sketches: Advances and Problems (2011)
Information Extraction for Czech Based on Syntactic Analysis (2011)
Analyzing Time-Related Clauses in Transparent Intensional Logic (2011)
Czech Morphological Tagset Revisited (2011)
Finding Multiwords of More Than Two Words (2012)
Building a 70 billion word corpus of English from ClueWeb (2012)
Setting up for corpus lexicography (2012)
Rule-Based Parsing of Morphologically Rich Languages (2012)
Authorship Verification based on Syntax Features (2012)
Adaptation of Czech Parsers for Slovak (2012)
Linguistic Logical Analysis of Direct Speech (2012)
Enhancing Czech Parsing with Verb Valency Frames (2013)
Syntactic parser SET (2012)
The TenTen Corpus Family (2013)
User-friendly interface of error/correction-annotated corpus for both teachers and researchers (2013)
Fast Construction of a Word-Number Index for Large Data (2013)
Portable Lexical Analysis for Parsing of Morphologically-Rich Languages (2013)
Towards taggers and parsers for Slovak (2013)
Methods for Detection of Word Usage over Time (2013)
PtTenTen: A corpus for Portuguese lexicography (2014)
Finding Terms in Corpora for Many Languages with the Sketch Engine (2014)
Extrinsic Corpus Evaluation with a Collocation Dictionary Task (2014)
Effective Corpus Virtualization (2014)
The Sketch Engine: ten years on (2014)
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing (2014)
arTenTen: Arabic Corpus and Word Sketches (2014)
Information Extraction for Czech Based on Syntactic Analysis (2014)
Character-based Language Model (2014)
Improving Coverage of Translation Memories with Language Modelling (2014)
Optimization of Regular Expression Evaluation within the Manatee Corpus Management System (2014)
Low Inter-Annotator Agreement = An Ill-Defined Problem? (2014)
SkELL: Web Interface for English Language Learning (2014)
Text Tokenisation Using unitok (2014)
Intelligent Search and Replace for Czech Phrases (2014)
Semiautomatic Building and Extension of Terminological Thesaurus for Land Surveying Domain (2014)
Mapping Czech and English Valency Lexicons: Preliminary Report (2014)
Partial Grammar Checking for Czech Using the SET Parser (2014)
Interactive Visualizations of Corpus Data in Sketch Engine (2015)
Longest-commonest Match (2015)
SemEval-2015 Task 15: A CPA dictionary-entry-building task (2015)
Electronic lexicography in the 21st century: linking lexical data in the digital age. Proceedings of the eLex 2015 conference (2015)
Increasing Coverage of Translation Memories with Linguistically Motivated Segment Combination Methods (2015)
Turkic Language Support in Sketch Engine (2015)
Corpus Based Extraction of Hypernyms in Terminological Thesaurus for Land Surveying Domain (2015)
Towards Automatic Finding of Word Sense Changes in Time (2015)
Bilingual Terminology Extraction in Sketch Engine (2015)
Software and Data for Corpus Pattern Analysis (2015)
AST: New Tool for Logical Analysis of Sentences based on Transparent Intensional Logic (2015)
Technologie pro využití obsahu tezauru pro publikační účely (2015)
Terminologický tezaurus pro obor zeměměřictví a katastru nemovitostí: Certifikovaná metodika (2015)
On Evaluation of Natural Language Processing Tasks: Is Gold Standard Evaluation Methodology a Good Solution? (2016)
Walking the tightrope between linguistics and language engineering (2017)
European Union Language Resources in Sketch Engine (2016)
Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study (2016)
VPS-GradeUp: Graded Decisions on Usage Patterns (2016)
Sketch Engine for Bilingual Lexicography (2016)
RuSkELL: Online Language Learning Tool for Russian Language (2016)
What Do Graded Decisions Tell Us about Verb Uses (2016)
Multilingual CPA: Linking Verb Patterns across Languages (2016)
Semantic word sketches (2015)
English-French Document Alignment Based on Keywords and Statistical Translation (2016)
Sketch Engine for English Language Learning (2015)
DIACRAN: a framework for diachronic analysis (2015)
Longest-commonest match (2015)
Annotated Amharic Corpora (2016)
AQA: Automatic Question Answering System for Czech (2016)
Evaluation and Improvements in Punctuation Detection for Czech (2016)
Finding Definitions in Large Corpora with Sketch Engine (2016)
Between Comparable and Parallel: English-Czech Corpus from Wikipedia (2016)
Czech Grammar Agreement Dataset for Evaluation of Language Models (2016)
Bilingual Logical Analysis of Natural Language Sentences (2016)
Options for Automatic Creation of Dictionary Definitions from Corpora (2016)
Evaluating Natural Language Processing Tasks with Low Inter-Annotator Agreement: The Case of Corpus Applications (2016)
Terminology Extraction for Academic Slovene Using Sketch Engine (2016)
Large Scale Keyword Extraction using a Finite State Backend (2016)
DSL Shared task 2016: Perfect Is The Enemy of Good Language Discrimination Through Expectation-Maximization and Chunk-based Language Model (2016)
HaBiT system (2017)
Set of Ethiopian Web Corpora (2016)
The advent of post-editing lexicography (2017)
Accelerating Corpus Search Using Multiple Cores (2017)
Overview and Future of Czech Wordnet (2017)
Lexicography and Natural Language Processing (2017)
Text Punctuation: An Inter-annotator Agreement Study (2017)
Sentence and Word Embedding Employed in Open Question-Answering (2018)
Automated Identification of Domain Preferences of Collocations (2017)
Manipulative Propaganda Techniques: Technical Report (2017)
Idiomatic Expressions in VerbaLex (2017)
New features in DEBVisDic for WordNet Visualization and User Feedback (2017)
Multilinguality Adaptations of Natural Language Logical Analyzer (2017)
Enlargement of the Czech Question-Answering Dataset to SQAD v2.0 (2017)
Noc vědců FF MU 2017 (2017)
Software Tools for Big Data Resources in Family Names Dictionaries (2018)
Recognition of OCR Invoice Metadata Block Types (2018)
Practical Post- Editing Lexicography with Lexonomy and Sketch Engine (2018)
Wordnet Consistency Checking via Crowdsourcing (2018)
Jazyková úroveň zpravodajských vstupů (2018)
OCRMiner (2018)
Removing spam from web corpora through supervised learning using FastText (2017)
Original or Copy Paste? An Analysis of Czech Disinformation Media Outlets Content (2018)
csTenTen17, a Recent Czech Web Corpus (2018)
Automatically Created Noun Definitions for Czech (2018)
Towards Czech Answer Type Analysis (2018)
Recurrent Networks in AQA Answer Selection (2018)
Sustainable long-term WordNet development and maintenance: Case study of the Czech WordNet (2018)
A New Approach for Semi-automatic Building and Extending a Multilingual Terminology Thesaurus (2019)
Question and Answer Classification in Czech Question Answering Benchmark Dataset (2019)
Multiple Instance Terminological Thesaurus with Central Management (2018)
Business Research Forum : prezentace projektu TL02000146 Webový pravopisný, gramatický a typografický korektor pro český jazyk (2019)
VerbaLex - Comprehensive Dictionary of Czech Verb Valencies (2019)
Word Sense Induction Using Word Sketches (2019)
Automating dictionary production: a Tagalog-English-Korean dictionary from scratch (2019)
Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts (2019)
Noc vědců 2019: Čeština na FF MU: Fauna a Flóra – Musíme Uchránit (2019)
SkELL Corpora as a Part of the Language Portal Sonaveeb: Problems and Perspectives (2019)
Počítačová lingvistika v Brně (2019)
Proceedings of the Thirteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2019 (2019)
Neural Tagger for Czech Language: Capturing Linguistic Phenomena in Web Corpora (2019)
Evaluation and Error Analysis of Rule-based Paraphrase Generation for Czech (2019)
Structured Information Extraction from Pharmaceutical Records (2019)
Automatically Created Noun Explanations for English (2019)
Czech Question Answering with Extended SQAD v3.0 Benchmark Dataset (2019)
Discriminating Between Similar Languages Using Large Web Corpora (2019)
A Distributional Multi-word Thesaurus in Sketch Engine (2019)
New Online Proofreader for Czech (2019)
Lingvistická analýza mluveného projevu pacientů s afázií (2019)
Konvence, kreativita a patologie v mluvené češtině (2019)
Semantics of Token Positions in Sentences (2019)
Karel and Karel (Pepper) (2019)
Corpus-based Automatic Identification of Multi-word Entry Candidates (2019)
Electronic lexicography in the 21st century. Proceedings of the eLex 2017 conference (2017)
Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference (2019)
Lexonomy Clinic (2019)
Webová aplikace pro analýzu sentimentu (2019)
Propaganda web crawler (2019)
Propaganda analysis and annotation (2019)
Propaganda detection tool (2020)
Improving RNN-based Answer Selection for Morphologically Rich Languages (2020)
Current Challenges in Web Corpus Building (2020)
Genre Annotation of Web Corpora: Scheme and Issues (2021)
Cthulhu Hails from Wales: N-gram Frequency Analysis of R'lyehian (2020)
Employing Sentence Context in Czech Answer Selection (2020)
Efficient Management and Optimization of Very Large Machine Learning Dataset for Question Answering (2020)
Removing Spam from Web Corpora Through Supervised Learning and Semi-manual Classification of Web Sites (2020)
Softwarové nástroje pro tvorbu odborných tezaurů a slovníků pro potřeby rozvoje národní infrastruktury pro prostorové informace (2021)
New developments in Lexonomy (2021)
Evaluating Long Contexts in the Czech Answer Selection Task (2021)
DMoG : A Data-Based Morphological Guesser (2021)
Website Properties in Relation to the Quality of Text Extracted for Web Corpora (2021)
Development of HAMOD: a High Agreement Multi-lingual Outlier Detection dataset (2021)
Comparing RNN and Transformer Context Representations in the Czech Answer Selection Task (2022)
Software Tezaurus: nástroje pro podporu systematické a koordinované tvorby odborných tezaurů (2021)
Software Tezaurus: nástroje pro centrální systém tvorby odborných tezaurů (2021)
Software Tezaurus: funkcionalita pro styk s externími systémy (2021)
Opravidlo (2022)
New Language Identification and Sentiment Analysis Modules for Social Media Communication (2022)
Are Dictionary Definitions of Verbs in Corpora? : Discovering Dead Ends in Generating Explanations of Verbs (2022)
Blooming Onion: Efficient Deduplication through Approximate Membership Testing (2022)
Manipulative Style Recognition of Czech News Texts using Stylometric Text Analysis (2022)
Semi-Manual Annotation of Topics and Genres in Web Corpora : The Cheap and Fast Way (2022)
Using NVH as a Backbone Format in the Lexonomy Dictionary Editor (2022)
Evaluation of Automatically Constructed Word Meaning Explanations (2022)
MoMaT WebAPI (2022)
Evaluation of the Cross-lingual Embedding Models from the Lexicographic Perspective (2023)
Rapid Ukrainian-English Dictionary Creation Using Post-Edited Corpus Data (2023)
Development of Evidence-Based Grammars for Terminology Extraction in OneClick Terms (2023)
Corpus-based Bilingual Terminology Extraction using One-Click Terms (2023)
Web corpora for under-resourced languages (2023)
Towards Perfection of Machine Learning of Competing Patterns: The Use Case of Czechoslovak Patterns Development (2023)
A Roadmap for Universal Syllabic Segmentation (2023)
Does Size Matter? - Comparing Evaluation Dataset Size for the Bilingual Lexicon Induction (2023)
Five Years of Language Services (2023)
Reproducibility and Robustness of Authorship Identification Approaches (2023)
Overview of Latin American and Iberian corpora in Sketch Engine (2024)
Recognition of Propaganda Techniques in Newspaper Texts: Fusion of Content and Style Analysis (2024)
Identifying Metadata-Specific Collocations in Text Corpora (2021)
Semi-automatic building of large-scale digital dictionaries (2021)
ShadowSense: A Multi-annotated Dataset for Evaluating Word Sense Induction (2024)
Fine-Grained Language Relatedness for Zero-Shot Silesian-English Translation (2023)
On Rapid Annotation of Czech Headwords: Analysing the First Tasks of Czech Dictionary Express (2024)
Dictionary Express: First Phases Rapid dictionary-making method for European, Asian and other languages (2024)
Development of the NVH Schema Format for Lexicographic Purposes (2023)
DANTE Resurrected Large Open Source Lexical Data Database for English (2024)
The end of lexicography? Can ChatGPT outperform current tools for post-editing lexicography? (2023)
A New Czech Pipeline in Sketch Engine (2024)
SlamaTrain– Representative Training Dataset for Slavonic Large Language Models (2024)

Odebrat vše z výběru

Operace s vybranými výsledky

Export a tisk
Seznam vybraných záznamů

Vyhledávání publikací

Vybrané publikace/výsledky

Operace s vybranými výsledky

Další aplikace