Skip to main content

Research Repository

Advanced Search

Output search results for "horvath" (36)

Cluster-based oversampling with area extraction from representative points for class imbalance learning (2024)
Journal Article
Farou, Z., Wang, Y., & Horváth, T. (2024). Cluster-based oversampling with area extraction from representative points for class imbalance learning. Intelligent Systems with Applications, 22, Article 200357. https://doi.org/10.1016/j.iswa.2024.200357

Class imbalance learning is challenging in various domains where training datasets exhibit disproportionate samples in a specific class. Resampling methods have been used to adjust the class distribution, but they often have limitations for small dis... Read More about Cluster-based oversampling with area extraction from representative points for class imbalance learning.

Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms (2024)
Journal Article
Mantovani, R. G., Horváth, T., Rossi, A. L. D., Cerri, R., Barbon Junior, S., Vanschoren, J., & de Carvalho, A. C. P. L. F. (2024). Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms. Data Mining and Knowledge Discovery, 38, 1364–1416. https://doi.org/10.1007/s10618-024-01002-5

Machine learning algorithms often contain many hyperparameters whose values affect the predictive performance of the induced models in intricate ways. Due to the high number of possibilities for these hyperparameter configurations and their complex i... Read More about Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms.

Factorization Techniques for Predicting Student Performance (2012)
Book Chapter
Thai-Nghe, N., Drumond, L., Horváth, T., Krohn-Grimberghe, A., Nanopoulos, A., & Schmidt-Thieme, L. (2012). Factorization Techniques for Predicting Student Performance. In O. C. Santos, & J. G. Boticario (Eds.), Educational Recommender Systems and Technologies: Practices and Challenges (129-153). IGI Global. https://doi.org/10.4018/978-1-61350-489-5.ch006

Recommender systems are widely used in many areas, especially in e-commerce. Recently, they are also applied in e-learning for recommending learning objects (e.g. papers) to students. This chapter introduces state-of-the-art recommender system techni... Read More about Factorization Techniques for Predicting Student Performance.

Tracing the Local Breeds in an Outdoor System – A Hungarian Example with Mangalica Pig Breed (2022)
Book Chapter
Alexy, M., & Horváth, T. Tracing the Local Breeds in an Outdoor System – A Hungarian Example with Mangalica Pig Breed. In Tracing the Domestic Pig. IntechOpen. https://doi.org/10.5772/intechopen.101615

Pig farming is largely characterized by closed, large-scale housing technology. These systems are driven by resource efficiency. In intensive technologies, humans control almost completely. However, there are pig farming systems where humans have jus... Read More about Tracing the Local Breeds in an Outdoor System – A Hungarian Example with Mangalica Pig Breed.

New Trends in Databases and Information Systems: ADBIS 2018 Short Papers and Workshops, AI*QA, BIGPMED, CSACDB, M2U, BigDataMAPS, ISTREND, DC, Budapest, Hungary, September, 2-5, 2018, Proceedings (2018)
Presentation / Conference Contribution
(2018, September). New Trends in Databases and Information Systems: ADBIS 2018 Short Papers and Workshops, AI*QA, BIGPMED, CSACDB, M2U, BigDataMAPS, ISTREND, DC, Budapest, Hungary, September, 2-5, 2018, Proceedings. Presented at 22th European Conference on Advances in Databases and Information Systems, ADBIS 2018, Budapest, Hungary

NCC: Neural concept compression for multilingual document recommendation (2023)
Presentation / Conference Contribution
Tashu, T. M., Lenz, M., & Horváth, T. NCC: Neural concept compression for multilingual document recommendation

In this work, we propose a novel method for generating inter-lingual document representations using neural network concept compression. The presented approach is intended to improve the quality of content-based multilingual document recommendation an... Read More about NCC: Neural concept compression for multilingual document recommendation.

Hyper-parameter initialization of classification algorithms using dynamic time warping: A perspective on PCA meta-features (2022)
Presentation / Conference Contribution
Horváth, T., Mantovani, R. G., & de Carvalho, A. C. Hyper-parameter initialization of classification algorithms using dynamic time warping: A perspective on PCA meta-features

Meta-learning, a concept from the area of automated machine learning, aims at providing decision support for data scientists by recommending a suitable setting (a machine learning algorithm or its hyper-parameters) to be used for a given dataset. Suc... Read More about Hyper-parameter initialization of classification algorithms using dynamic time warping: A perspective on PCA meta-features.

Dynamic noise filtering for multi-class classification of beehive audio data (2022)
Journal Article
Várkonyi, D. T., Seixas Junior, J. L., & Horváth, T. (2023). Dynamic noise filtering for multi-class classification of beehive audio data. Expert Systems with Applications, 213(Part A), Article 118850. https://doi.org/10.1016/j.eswa.2022.118850

Honeybees are the most specialized insect pollinators and are critical not only for honey production but, also, for keeping the environmental balance by pollinating the flowers of a wide variety of crops.

Recording and analyzing bee sounds became... Read More about Dynamic noise filtering for multi-class classification of beehive audio data.

Object Detection Using Sim2Real Domain Randomization for Robotic Applications (2022)
Journal Article
Horváth, D., Erdős, G., Istenes, Z., Horváth, T., & Földi, S. (2023). Object Detection Using Sim2Real Domain Randomization for Robotic Applications. IEEE Transactions on Robotics, 39(2), 1225-1243. https://doi.org/10.1109/tro.2022.3207619

Robots working in unstructured environments must be capable of sensing and interpreting their surroundings. One of the main obstacles of deep-learning-based models in the field of robotics is the lack of domain-specific labeled data for different ind... Read More about Object Detection Using Sim2Real Domain Randomization for Robotic Applications.

Multimodal Emotion Recognition from Art Using Sequential Co-Attention (2021)
Journal Article
Tashu, T. M., Hajiyeva, S., & Horvath, T. (2021). Multimodal Emotion Recognition from Art Using Sequential Co-Attention. Journal of Imaging, 7(8), Article 157. https://doi.org/10.3390/jimaging7080157

In this study, we present a multimodal emotion recognition architecture that uses both feature-level attention (sequential co-attention) and modality attention (weighted modality fusion) to classify emotion in art. The proposed architecture helps the... Read More about Multimodal Emotion Recognition from Art Using Sequential Co-Attention.

Swarm intelligence techniques in recommender systems - A review of recent research (2019)
Journal Article
Peška, L., Tashu, T. M., & Horváth, T. (2019). Swarm intelligence techniques in recommender systems - A review of recent research. Swarm and Evolutionary Computation, 48, 201-219. https://doi.org/10.1016/j.swevo.2019.04.003

One of the main current applications of Intelligent Systems are Recommender systems (RS). RS can help users to find relevant items in huge information spaces in a personalized way. Several techniques have been investigated for the development of RS.... Read More about Swarm intelligence techniques in recommender systems - A review of recent research.

Evolutionary computing in recommender systems: a review of recent research (2016)
Journal Article
Horváth, T., & de Carvalho, A. C. P. L. F. (2017). Evolutionary computing in recommender systems: a review of recent research. Natural Computing, 16(3), 441-462. https://doi.org/10.1007/s11047-016-9540-y

One of the main current applications of intelligent systems is recommender systems (RS). RS can help users to find relevant items in huge information spaces in a personalized way. Several techniques have been investigated for the development of RS. O... Read More about Evolutionary computing in recommender systems: a review of recent research.

Ranking Formal Concepts by Utilizing Matrix Factorization (2014)
Presentation / Conference Contribution
Pisková, L., Horvath, T., & Krajči, S. Ranking Formal Concepts by Utilizing Matrix Factorization. Presented at 12th International Conference on Formal Concept Analysis, Cluj-Napoca, Romania

Formal Concept Analysis often produce huge number of formal concepts even for small input data. Such a large amount of formal concepts, which is intractable to analyze for humans, calls for a kind of
a ranking of formal concepts according to their i... Read More about Ranking Formal Concepts by Utilizing Matrix Factorization.

Buried pipe localization using an iterative geometric clustering on GPR data (2013)
Journal Article
Janning, R., Busche, A., Horváth, T., & Schmidt-Thieme, L. (2014). Buried pipe localization using an iterative geometric clustering on GPR data. Artificial Intelligence Review, 42(3), 403-425. https://doi.org/10.1007/s10462-013-9410-2

Ground penetrating radar is a non-destructive method to scan the shallow subsurface for detecting buried objects like pipes, cables, ducts and sewers. Such buried objects cause hyperbola shaped reflections in the radargram images achieved by GPR. Ori... Read More about Buried pipe localization using an iterative geometric clustering on GPR data.

GRAMOFON: General model-selection framework based on networks (2011)
Journal Article
Buza, K., Nanopoulos, A., Horváth, T., & Schmidt-Thieme, L. (2012). GRAMOFON: General model-selection framework based on networks. Neurocomputing, 75(1), 163-170. https://doi.org/10.1016/j.neucom.2011.02.026

Ensembles constitute one of the most prominent class of hybrid prediction models. One basically assumes that different models compensate each other's errors if one combines them in an appropriate way. Often, a large number of various prediction model... Read More about GRAMOFON: General model-selection framework based on networks.

A Model of User Preference Learning for Content-Based Recommender Systems (2009)
Journal Article
Horvath, T. (2009). A Model of User Preference Learning for Content-Based Recommender Systems. Computing and Informatics, 28(4), 1001-1029

This paper focuses to a formal model of user preference learning for
content-based recommender systems. First, some fundamental and special requirements to user preference learning are identified and proposed. Three learning tasks are introduced as... Read More about A Model of User Preference Learning for Content-Based Recommender Systems.

User Preference Web Search -- Experiments with a System Connecting Web and User (2009)
Journal Article
Gurský, P., Horvath, T., Jirásek, J., Krajči, S., Novotny, R., Pribolová, J., Vaneková, V., & Vojtáš, P. (2009). User Preference Web Search -- Experiments with a System Connecting Web and User. Computing and Informatics, 28(4), 1001-1033

We present models, methods, implementations and experiments with a system enabling personalized web search for many users with different preferences. The system consists of a web information extraction part, a text search engine, a middleware support... Read More about User Preference Web Search -- Experiments with a System Connecting Web and User.

Knowledge Processing for Web Search – An Integrated Model and Experiments (2008)
Presentation / Conference Contribution
Gurský, P., Horvath, T., Jirásek, J., Novotny, R., Pribolová, J., Vaneková, V., & Vojtáš, P. Knowledge Processing for Web Search – An Integrated Model and Experiments. Presented at Symposium on Intelligent and Distributed Computing IDC, Craiova, Romania

We propose a model of a middleware system enabling personalized web
search for users with different preferences. We integrate both inductive and deductive tasks to find user preferences and consequently best objects. The model is based on modeling p... Read More about Knowledge Processing for Web Search – An Integrated Model and Experiments.

An ILP model for a monotone graded classification problem (2004)
Presentation / Conference Contribution
Vojtáš, P., Horvath, T., Krajči, S., & Lencses, R. An ILP model for a monotone graded classification problem. Presented at Znalosti 2003, Ostrava, Czech Republic

Motivation for this paper are classification problems in which data can not be clearly divided into positive and negative examples, especially data in which there is a monotone hierarchy (degree, preference) of more or less positive (negative) exampl... Read More about An ILP model for a monotone graded classification problem.

Reducing Annotation Effort in Automatic Essay Evaluation Using Locality Sensitive Hashing (2019)
Presentation / Conference Contribution
Tashu, T. M., Szabó, D., & Horváth, T. (2019, June). Reducing Annotation Effort in Automatic Essay Evaluation Using Locality Sensitive Hashing. Presented at ITS2019 Conference, Kingston, Jamaica

Automated essay evaluation systems use machine learning models to predict the score for an essay. For such, a training essay set is required which is usually created by human requiring time-consuming effort. Popular choice for scoring is a nearest ne... Read More about Reducing Annotation Effort in Automatic Essay Evaluation Using Locality Sensitive Hashing.

Intelligent On-line Exam Management and Evaluation System (2019)
Presentation / Conference Contribution
Tashu, T. M., Esclamado, J. P., & Horvath, T. (2019, June). Intelligent On-line Exam Management and Evaluation System. Presented at 15th International Conference, ITS 2019, Kingston, Jamaica

Educational assessment plays a central role in the teaching-learning process as a tool for evaluating students’ knowledge of the concepts associated with the learning objectives. The evaluation and scoring of essay answers is a process, besides being... Read More about Intelligent On-line Exam Management and Evaluation System.

Solving Multi-class Imbalance Problems Using Improved Tabular GANs (2022)
Presentation / Conference Contribution
Farou, Z., Kopeikina, L., & Horváth, T. (2022, November). Solving Multi-class Imbalance Problems Using Improved Tabular GANs. Presented at 23rd International Conference on Intelligent Data Engineering and Automated Learning (IDEAL), Manchester

Multi-class imbalance problems are non-standard derivative data science problems. These problems are associated with the skewness in the data underlying distribution, which, in turn, raises numerous issues for conventional machine learning techniques... Read More about Solving Multi-class Imbalance Problems Using Improved Tabular GANs.

Attention-Based Multi-modal Emotion Recognition from Art (2021)
Presentation / Conference Contribution
Tashu, T. M., & Horváth, T. (2021, January). Attention-Based Multi-modal Emotion Recognition from Art. Presented at ICPR 2021, Online

Emotions are very important in dealing with human decisions, interactions, and cognitive processes. Art is an imaginative human creation that should be appreciated, thought-provoking, and elicits an emotional response. The automatic recognition of em... Read More about Attention-Based Multi-modal Emotion Recognition from Art.

Synonym-Based Essay Generation and Augmentation for Robust Automatic Essay Scoring (2022)
Presentation / Conference Contribution
Tashu, T. M., & Horváth, T. (2022, November). Synonym-Based Essay Generation and Augmentation for Robust Automatic Essay Scoring. Presented at 23rd International Conference on Intelligent Data Engineering and Automated Learning (IDEAL), Manchester

Automatic essay scoring (AES) models based on neural networks (NN) have had a lot of success. However, research has shown that NN-based AES models have robustness issues, such that the output of a model changes easily with small changes in the input.... Read More about Synonym-Based Essay Generation and Augmentation for Robust Automatic Essay Scoring.

A Novel Evaluation Metric for Synthetic Data Generation (2020)
Presentation / Conference Contribution
Galloni, A., Lendák, I., & Horváth, T. (2020, November). A Novel Evaluation Metric for Synthetic Data Generation. Presented at IDEAL 2020: 21st International Conference on Intelligent Data Engineering and Automated Learning, Guimarães, Portugal

Differentially private algorithmic synthetic data generation (SDG) solutions take input datasets Dp consisting of sensitive, private data and generate synthetic data Ds with similar qualities. The importance of such solutions is increasing both becau... Read More about A Novel Evaluation Metric for Synthetic Data Generation.

Linear Concept Approximation for Multilingual Document Recommendation (2021)
Book Chapter
Salamon, V. T., Tashu, T. M., & Horváth, T. (2021). Linear Concept Approximation for Multilingual Document Recommendation. . Springer. https://doi.org/10.1007/978-3-030-91608-4_15

In this paper, we proposed Linear Concept Approximation, a novel multilingual document representation approach for the task of multilingual document representation and recommendation. The main idea is in creating representations by using mappings to... Read More about Linear Concept Approximation for Multilingual Document Recommendation.

A Comparative Study of Assessment Metrics for Imbalanced Learning (2023)
Presentation / Conference Contribution
Farou, Z., Aharrat, M., & Horváth, T. (2023, September). A Comparative Study of Assessment Metrics for Imbalanced Learning. Presented at European Conference on Advances in Databases and Information Systems (ADBIS 2023), Barcelona, Spain

There are several machine learning algorithms addressing class imbalance problem, requiring standardized metrics for adequete performance evaluation. This paper reviews several metrics for imbalanced learning in binary and multi-class problems. We em... Read More about A Comparative Study of Assessment Metrics for Imbalanced Learning.

Directed Undersampling Using Active Learning for Particle Identification (2022)
Presentation / Conference Contribution
Farou, Z., Ouaari, S., Domian, B., & Horváth, T. (2021, May). Directed Undersampling Using Active Learning for Particle Identification. Presented at 4th International Conference on Recent Innovations in Computing (ICRIC-2021), Central University of Jammu

Time-Series in Hyper-parameter Initialization of Machine Learning Techniques (2021)
Presentation / Conference Contribution
Horváth, T., Mantovani, R. G., & de Carvalho, A. C. P. L. F. (2021, November). Time-Series in Hyper-parameter Initialization of Machine Learning Techniques. Presented at 22nd International Conference on Intelligent Data Engineering and Automated Learning (IDEAL2021), Manchester

Initializing the hyper-parameters (HPs) of machine learning (ML) techniques became an important step in the area of automated ML (AutoML). The main premise in HP initialization is that a HP setting that performs well for a certain dataset(s) will als... Read More about Time-Series in Hyper-parameter Initialization of Machine Learning Techniques.

Migrating Models: A Decentralized View on Federated Learning (2021)
Presentation / Conference Contribution
Kiss, P., & Horváth, T. (2021, September). Migrating Models: A Decentralized View on Federated Learning. Presented at ECML PKDD 2021, Online

Federated learning (FL) researches attempt to alleviate the increasing difficulty of training machine learning models, when the training data is generated in a massively distributed way. The key idea behind these methods is moving the training to loc... Read More about Migrating Models: A Decentralized View on Federated Learning.

Squared Symmetric Formal Contexts and Their Connections with Correlation Matrices (2023)
Presentation / Conference Contribution
Antoni, L., Eliaš, P., Horváth, T., Krajči, S., Krídlo, O., & Török, C. (2023, September). Squared Symmetric Formal Contexts and Their Connections with Correlation Matrices. Presented at International Conference on Conceptual Structures (ICCS) 2023, Berlin

Formal Concept Analysis identifies hidden patterns in data that can be presented to the user or the data analyst. We propose a method for analyzing the correlation matrices based on Formal concept analysis. In particular, we define a notion of square... Read More about Squared Symmetric Formal Contexts and Their Connections with Correlation Matrices.

Denoising Architecture for Unsupervised Anomaly Detection in Time-Series (2022)
Presentation / Conference Contribution
Skaf, W., & Horváth, T. (2022, September). Denoising Architecture for Unsupervised Anomaly Detection in Time-Series. Presented at ADBIS 2022: 26th European Conference on Advances in Databases and Information Systems, Turin, Italy

Anomalies in time-series provide insights of critical scenarios across a range of industries, from banking and aerospace to information technology, security, and medicine. However, identifying anomalies in time-series data is particularly challenging... Read More about Denoising Architecture for Unsupervised Anomaly Detection in Time-Series.

Data Generation Using Gene Expression Generator (2020)
Presentation / Conference Contribution
Farou, Z., Mouhoub, N., & Horváth, T. (2020, November). Data Generation Using Gene Expression Generator. Presented at IDEAL 2020: 21st International Conference on Intelligent Data Engineering and Automated Learning, Guimarães, Portugal

Generative adversarial networks (GANs) could be used efficiently for image and video generation when labeled training data is available in bulk. In general, building a good machine learning model requires a reasonable amount of labeled training data.... Read More about Data Generation Using Gene Expression Generator.

Integration of two fuzzy data mining methods (2004)
Journal Article
Horvath, T., & Krajči, S. (2004). Integration of two fuzzy data mining methods. Neural Network World, 14(5), 391-402

The cluster analysis and the formal concept analysis are both used to identify significiant groups of similar objects. Rice & Siff's algorithm for the clustering joins these two methods in the case where the values of an object-attribute model are 1... Read More about Integration of two fuzzy data mining methods.