Badges
Certifications
Work Experience
Assistant Research Fellow
Hungarian Research Centre for Linguistics• September 2021 - December 2021
Examined semantic networks and the grammaticalization of posture verbs in various languages. Tasks: • data collection (from various printed and digital sources) and labelling data in Excel, • performing descriptive data analysis with Python (Pandas), documenting the research with Jupyter Notebook, and visualising the data using Excel and Matplotlib, • publishing the results in academic journals.
Co-Founder
Belfry.io• April 2015 - July 2018
Belfry.io was a multiplatform tool that was able to identify hate speech automatically with 97% accuracy in user-generated comments (using NLP tools and AI), in order to help moderate comment streams. • Conducted customer interviews to validate the product idea and to better understand customer needs. • Identified data sets with high accuracy for training machine learning models, and set a plan for collecting the data. • Created gold-standard data sets for evaluating machine learning models (mainly SVM and random forest). • Conducted SQL (MYSQL) queries in BigQuery database to create ontologies and controlled vocabularies. • Adapted NLP tools (e.g., HunSpell morphological analyser library and command-line tool) to online communication for feature extraction. • Supervised a team of five colleagues: worked closely with developers, machine learning engineers, and a UX/UI expert. • Wrote manuals for end users of Belfry.io, and trained them to use the software. • Designed a hallway usability test to uncover user experience problems, and collected other high-level feedback from users. • Gave investor pitch decks and meetup talks using PowerPoint and Google Slides. • Did sales and account management.
Assistant Research Fellow
Eötvös Loránd University• September 2013 - August 2017
The goal of the Multilingual practices in Finno-Ugric communities project was to create databases of three endangered Uralic languages that are difficult to research in the absence of audio material. • Participated in the writing the research proposal, the financial planning and administration of the publicly funded project (total budget: 115,000 Euro). • Designed the structure of the databases: selected the IT tools used (Praat, ELAN) and elaborated the annotation principles. • Trained four annotators, wrote manuals, and supervised the database construction process. • Collected data using both structured and unstructured interviews, and questionnaire surveys as well. • Built a speech corpus of 7000 tokens (transcribing, translating, and annotating various peculiarities) in ELAN. • Performed data cleaning and descriptive data analysis using Python (Pimpy, NumPy, Pandas), and visualised the trends using Excel. • Gave presentations at national and international scientific conferences, popular science events and university curses (using PowerPoint) and published the results in scientific periodicals.
Education
Eötvös Lorand University
Linguistics, PhD• September 2011 - January 2022
Examined how information is delivered in spontaneous speech through intonation units in Mansi. • Conducted literature review and formulated research questions based on scientific needs. • Collected data using structured interviews and questionnaire surveys, and built / expanded the first database of Mansi speech. • Performed descriptive data analysis: searched for patterns and trends in complex, multivariable data sets and visualized the data with Excel and Python (Pandas, NumPy, Matplotlib). • Documented the research with Jupyter Notebook, and published the findings in academic journals, gave presentations at scientific conferences and popular science events (using PowerPoint). • Led BA and MA courses.
Eötvös Lorand University
Applied Linguistics, MA• September 2007 - July 2011
Eötvös Lorand University
Philology in Finno-Ugrian studies, MA• September 2005 - July 2010
Links
Skills
katkasi has not updated skills details yet.