Personal information

Czech Republic

Biography

I am a researcher in Natural Language Processing, working at Charles University in Prague

Activities

Employment (1)

Charles University in Prague: Praha, CZ

2003 to present | Senior Researcher (Institute of Formal and Applied Linguistics)
Employment
Source: Self-asserted source
Pavel Straňák

Education and qualifications (2)

Charles University in Prague: Prague, CZ

2011 | Ph.D. (Institute of Formal and Applied Linguistics)
Education
Source: Self-asserted source
Pavel Straňák

University of Ostrava: Ostrava, CZ

2001 | Mgr. (M.A.)
Education
Source: Self-asserted source
Pavel Straňák

Works (50 of 60)

Items per page:
Page 1 of 2

LINDAT/CLARIAH-CZ: Where We Are and Where We Go

CLARIN: The Infrastructure for Language Resources
2022 | Book chapter
Part of ISBN: 978-3-11-076734-6
Contributors: Jan Hajič; Eva Hajičová; Barbora Hladká; Ondřej Košarko; Jozef Mišutka; Pavel Straňák
Source: Self-asserted source
Pavel Straňák

ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2021 | Book
EID:

2-s2.0-85115263527

Part of ISSN: 16113349 03029743
Contributors: Kopp, M.; Stankov, V.; Krůza, J.O.; Straňák, P.; Bojar, O.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier
grade
Preferred source (of 2)‎

CLARIN-DSPACE repository at LINDAT/CLARIN

Grey Journal
2020 | Journal article
EID:

2-s2.0-85079072996

Part of ISSN: 1574180X 15741796
Contributors: Straňák, P.; Košarko, O.; Mišutka, J.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Processing personal data without the consent of the data subject for the development and use of language resources

Linköping Electronic Conference Proceedings
2020 | Other
Part of ISSN: 1650-3740
Source: Self-asserted source
Pavel Straňák

The Impact of Copyright and Personal Data Laws on the Creation and Use of Language Models

Linköping Electronic Conference Proceedings
2020 | Journal article
Part of ISSN: 1650-3740
Source: Self-asserted source
Pavel Straňák

Bridging the lapps grid and clarin

LREC 2018 - 11th International Conference on Language Resources and Evaluation
2019 | Conference paper
EID:

2-s2.0-85059916170

Contributors: Hinrichs, E.; Ide, N.; Pustejovsky, J.; Hajič, J.; Hinrichs, M.; Elahi, M.F.; Suderman, K.; Verhagen, M.; Rim, K.; Straňák, P. et al.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

CLARIN-DSpace repository at LINDAT/CLARIN : LINDAT/CLARIN FAIR repository for language data

the grey Journal – International Journal on Grey Literature
2019 | Journal article
Part of ISSN: 1574-1796
Source: Self-asserted source
Pavel Straňák

Diacritics restoration using neural networks

LREC 2018 - 11th International Conference on Language Resources and Evaluation
2019 | Conference paper
EID:

2-s2.0-85059902129

Contributors: Náplava, J.; Straka, M.; Straňák, P.; Hajič, J.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Bridging the LAPPS Grid and CLARIN

Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018)
2018 | Conference paper
Part of ISBN: 979-10-95546-00-9
Source: Self-asserted source
Pavel Straňák

Diacritics Restoration Using Neural Networks

Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018)
2018 | Conference paper
Part of ISBN: 979-10-95546-00-9
Source: Self-asserted source
Pavel Straňák

Implementation of an Open Science Policy in the context of management of CLARIN language resources: a need for changes?

Linköping Electronic Conference Proceedings
2018 | Journal article
Part of ISSN: 1650-3740
Source: Self-asserted source
Pavel Straňák

Prague Dependency Treebank 3.5

2018 | Other
Source: Self-asserted source
Pavel Straňák

Extracting verbal multiword data from rich treebank annotation

CEUR Workshop Proceedings
2017 | Conference paper
EID:

2-s2.0-85013243622

Part of ISSN: 16130073
Contributors: Bejček, E.; Hajič, J.; Straňák, P.; Urešová, Z.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Extracting Verbal Multiword Data from Rich Treebank Annotation

Proceedings of the 15th International Workshop on Treebanks and Linguistic Theories (TLT 15)
2017 | Conference paper
Source: Self-asserted source
Pavel Straňák

Improving corpus search via parsing

Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
2016 | Conference paper
EID:

2-s2.0-85037091113

Contributors: Klyueva, N.; Straňák, P.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Improving Corpus Search via Parsing

Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
2016 | Conference paper
Part of ISBN: 978-2-9517408-9-1
Source: Self-asserted source
Pavel Straňák

Linguistic digital repository based on DSpace

2016 | Other
Source: Self-asserted source
Pavel Straňák

Linguistic digital repository based on DSpace 5.2

2016 | Other
Source: Self-asserted source
Pavel Straňák

Public License Selector

2016 | Other
Source: Self-asserted source
Pavel Straňák

The public license selector: Making open licensing easier

Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
2016 | Conference paper
EID:

2-s2.0-85037153513

Contributors: Kamocki, P.; Straňák, P.; Sedlák, M.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

The Public License Selector: Making Open Licensing Easier

Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
2016 | Conference paper
Part of ISBN: 978-2-9517408-9-1
Source: Self-asserted source
Pavel Straňák

« Trust me. I’m a License Selector ». Licensing for Digital Humanities

2016 | Other
Source: Self-asserted source
Pavel Straňák

B2SHARE: An open eScience data sharing platform

Proceedings - 11th IEEE International Conference on eScience, eScience 2015
2015 | Conference paper
EID:

2-s2.0-84959053352

Contributors: Ardestani, S.B.; Hakansson, C.J.; Laure, E.; Livenson, I.; Stranak, P.; Dima, E.; Blommesteijn, D.; Van De Sanden, M.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

B2SHARE: An Open eScience Data Sharing Platform

2015 IEEE 11th International Conference on e-Science (e-Science)
2015 | Conference paper
Part of ISBN: 978-1-4673-9325-6
Source: Self-asserted source
Pavel Straňák

Improvements to korektor: A case study with native and non-native Czech

CEUR Workshop Proceedings
2015 | Conference paper
EID:

2-s2.0-84944312382

Part of ISSN: 16130073
Contributors: Ramasamy, L.; Rosen, A.; Straňák, P.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Improvements to Korektor: A case study with native and non-native Czech

Proceedings of the 15th conference ITAT 2015: Slovenskočeský NLP workshop (SloNLP 2015)
2015 | Conference paper
Part of ISBN: 978-1515120650
Source: Self-asserted source
Pavel Straňák

HindEnCorp - Hindi-English and Hindi-only corpus for machine translation

Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
2014 | Conference paper
EID:

2-s2.0-85018263318

Contributors: Bojar, O.; Diatka, V.; Rychlý, P.; Straňák, P.; Suchomel, V.; Tamchyna, A.; Zeman, D.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

HindEnCorp – Hindi-English and Hindi-only Corpus for Machine Translation

Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014)
2014 | Conference paper
Part of ISBN: 978-2-9517408-8-4
Source: Self-asserted source
Pavel Straňák

HindEnCorp – Hindi-English and Hindi-only Corpus for Machine Translation

Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014)
2014 | Conference paper
Part of ISBN: 978-2-9517408-8-4
Source: Self-asserted source
Pavel Straňák

From PDT 2.0 to PDT 3.0 (Modifications and Complements)

2013 | Report
Source: Self-asserted source
Pavel Straňák

Syntactic Identification of Occurrences of Multiword Expressions in Text using a Lexicon with Dependency Structures

Proceedings of the 9th Workshop on Multiword Expressions, MWE 2013 - in conjunction with the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013
2013 | Conference paper
EID:

2-s2.0-85043626700

Contributors: Bejček, E.; Straňák, P.; Pecina, P.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Syntactic Identification of Occurrences of Multiword Expressions in Text using a Lexicon with Dependency Structures

The 9th Workshop on Multiword Expressions (MWE 2013)
2013 | Conference paper
Part of ISBN: 978-1-937284-47-3
Source: Self-asserted source
Pavel Straňák

’Upravy a doplňky Pražsk’eho z’avislostn’iho korpusu (Od PDT 2.0 k PDT 3.0)

2013 | Report
Source: Self-asserted source
Pavel Straňák

Korektor – A System for Contextual Spell-checking and Diacritics Completion

Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012)
2012 | Conference paper
Source: Self-asserted source
Pavel Straňák

Prague dependency treebank 2.5 - A revisited version of PDT 2.0

24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers
2012 | Conference paper
EID:

2-s2.0-84876797393

Contributors: Bejček, E.; Panevová, J.; Popelka, J.; Straňák, P.; Ševčíková, M.; Štěpánek, J.; Žabokrtský, Z.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Prague Dependency Treebank 2.5 -- a revisited version of PDT 2.0

Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012)
2012 | Conference paper
Source: Self-asserted source
Pavel Straňák

Influence of Treebank Design on Representation of Multiword Expressions

Lecture Notes in Computer Science
2011 | Journal article
Source: Self-asserted source
Pavel Straňák

Prague Dependency Treebank 2.5

2011 | Other
Source: Self-asserted source
Pavel Straňák

Data issues in English-to-Hindi machine translation

Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010
2010 | Conference paper
EID:

2-s2.0-84905693313

Contributors: Bojar, O.; Straňák, P.; Zeman, D.
Source: Self-asserted source
Pavel Straňák via Scopus - Elsevier

Data Issues in English-to-Hindi Machine Translation

Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010)
2010 | Conference paper
Part of ISBN: 2-9517408-6-7
Source: Self-asserted source
Pavel Straňák

Lexik’alně-s’emantick’a anotace PDT pomoc’i Česk’eho WordNetu

2010 | Other
Source: Self-asserted source
Pavel Straňák

Multiword Expressions in PDT 2.0

2010 | Other
Source: Self-asserted source
Pavel Straňák

Representing Layered and Structured Data in the CoNLL-ST Format

Proceedings of the Second International Conference on Global Interoperability for Language Resources
2010 | Conference paper
Part of ISBN: 978-962-442-323-5
Source: Self-asserted source
Pavel Straňák

UMC002: English-Hindi Parallel Corpus

2010 | Other
Source: Self-asserted source
Pavel Straňák

UMC004: Hindi Web Texts

2010 | Other
Source: Self-asserted source
Pavel Straňák

Česk’y WordNet 1.9 PDT

2010 | Other
Source: Self-asserted source
Pavel Straňák

CoNLL 2009 Shared Task Czech Development Set

2009 | Other
Source: Self-asserted source
Pavel Straňák

CoNLL 2009 Shared Task Czech Training Set

2009 | Other
Source: Self-asserted source
Pavel Straňák

CoNLL 2009 Shared Task Czech Trial Set

2009 | Other
Source: Self-asserted source
Pavel Straňák

English-Hindi Translation – Obtaining Mediocre Results with Bad Data and Fancy Models

Proceedings of ICON 2009: 7th International Conference on Natural Language Processing
2009 | Conference paper
Part of ISBN: 978-023-032-845-7
Source: Self-asserted source
Pavel Straňák
Items per page:
Page 1 of 2