Open-source software

[1] P. Agarwal, P. Balaprakash, S. Leyffer, and S. M. Wild. SPUDS: Smart Pipeline for Urban Data Science, 2017. SPUDS is a machine-learning pipeline to build classification models to rank food establishments that are at most risk for the types of violations most likely to spread food-borne illness. The pipeline balances the the training data with resampling techniques, identifies the most important factors leading to critical violations via variable importance and variable selection methods, evaluates several state-of-the-art learning with cross validation, and finally combines the best performing ones via bagging. It is a customized version of AutoMOMML, exclusively designed for City of Chicago Smart Data Platform, and implemented in Python. [ bib | http ]
[2] P. Balaprakash, A. Tiwari, S. M. Wild, L. Carrington, and P. D. Hovland. AutoMOMML: Automatic Multi-objective Modeling with Machine Learning, 2016. AutoMOMML is an end-to-end, machine-learning-based framework to build predictive models for objectives such as performance, and power. The framework adopts statistical approaches to reduce the modeling complexity and automatically identifies and configures the most suitable learning algorithm to model the required objectives based on hardware and application signatures. [ bib | http ]
[3] P. Balaprakash, A. Mametjanov, C. Choudary, P. D. Hovland, S. M. Wild, and G. Sabin. FPGAtuner: Autotuning FPGA Design Parameters for Performance and Power, 2016. A machine-learning-based approach to tune FPGA design parameters. It performs sampling-based reduction of the parameter space and guides the search toward promising parameter configurations. [ bib | http ]
[4] P. Balaprakash. SuRF: Search using Random Forest, 2011. SuRF is a model-based search module for automatic performance tuning. It adopts random forest supervised learning algorithm for modeling the performances as a function of input parameters within the search. SuRF samples a small number of parameter configurations, empirically evaluating the corresponding code variants to obtain the corresponding performance metrics, and fitting a surrogate model over the input-output space. The surrogate model is then iteratively refined by obtaining new output metrics at unevaluated input configurations predicted to be high-performing by the model. Implemented in Python and available with the Orio autotuning framework. [ bib | http ]
[5] P. Balaprakash, S. M. Wild, and B. Norris. SPAPT: Search Problems in Automatic Performance Tuning, 2011. A set of extensible and portable search problems in automatic performance tuning whose goal is to aid in the development and improvement of search strategies and performance-improving transformations. SPAPT contains representative implementations from a number of lower-level, serial performance-tuning tasks in scientific applications. Available with the Orio autotuning framework. [ bib | http ]
[6] M. L. Ibanez, J. D. Lacoste, T. Stützle, M. Birattari, E. Yuan, and P. Balaprakash. The irace Package: Iterated Race for Automatic Algorithm Configuration, 2010. The irace package implements the iterated racing procedure, which is an extension of the Iterated F-race procedure. Its main purpose is to automatically configure optimization algorithms by finding the most appropriate settings given a set of instances of an optimization problem. It builds upon the race package by Birattari, and it is implemented in R. [ bib | http ]
[7] P. Balaprakash, M. Birattari, and T. Stützle. ELS-PTSP: Estimation-based Local Search for the Probabilistic Traveling Salesman Problem, 2009. This software package provides a high-performance implementation of the estimation-based iterative improvement algorithm to tackle the probabilistic traveling salesman problem. A key novelty of the proposed algorithm is that the cost difference between two neighbor solutions is estimated by partial evaluation, adaptive, and importance sampling. Developed in C with GNU scientific library under Linux. [ bib | http ]