What’s new?

  • eRum2018: Europen R User Conference
    See you at eRum in Budapest where I’ll be giving the talk: Towards Native Declarative Data Collection with Question and Survey Instant Feedback in R&Shiny.

ODGAR Framework

for Online Declarative Data Gathering, Analyzing, and Reporting is coming…

Project homepage

Selected Publications

It is well documented that financial literacy is at best moderate around the world and that the cost of ignorance in this field may be high on both microeconomic and macroeconomic levels. We surveyed a representative sample of Poles to measure their debt literacy—a little‐studied aspect of financial literacy—and therefore obtain insight into the factors predicting it. Our study evidenced low levels of debt literacy and its overestimation by respondents in their self‐reports. We also confirmed some of the patterns found in former studies, including the gender gap and a positive relationship between the level of educational attainment and debt literacy. Finally, our examination provides compelling outcomes with regard to the segmentation of the sample on the basis of objective and subjective debt literacy scores. They show large heterogeneity of debt literacy and thus confirm the need for far‐reaching customization of debt‐oriented education.
The Journal of Consumer Affairs, 2018

Although the percentage of foreign students in Poland has increased over 9 times over past 10 years, it is still well below European Union average. We have been looking for determinants of willingness to study in Poland among members of Polish Diaspora, who already have ties with the country. We created empirical models of willingness of people with Polish origin to study abroad, in Poland, and in a peripheral Polish academic centre in particular. Such models can help policy-makers and universities to meet the needs of international students and make universities more competitive on the global higher education market.
International Journal of Educational Development, 2017

In recent years, there has been increased interest in methods for gender prediction based on first names that employ various open data sources. These methods have applications from bibliometric studies to customizing commercial offers for web users. Analysis of gender disparities in science based on such methods are published in the most prestigious journals, although they could be improved by choosing the most suited prediction method with optimal parameters and performing validation studies using the best data source for a given purpose.
In The R Journal, 2016

Recent Publications

(2018). Debt Literacy and Debt Literacy Self‐Assessment: The Case of Poland. The Journal of Consumer Affairs.

WWW Lista A: 30pkt.

(2017). Determinants of willingness to study in Poland among members of Polish Diaspora. International Journal of Educational Development.

PDF Project WWW Lista A: 25pkt.

(2017). Środowisko fizyczne i społeczne oraz jakość funkcjonowania szkoły a zdrowie subiektywne i zachowania zdrowotne nastolatków. Raport końcowy z realizacji projektu badawczego. Instytut Matki i Dziecka, Warszawa 2017.


(2017). Personal Loan Companies in Poland: Does Empirical Evidence Justify Regulatory Transition?. Prague Economic Papers 2017.

PDF WWW Lista A: 20pkt.

(2016). Loan perception and loan-related attitude of Poles: evidence from the field. Forum Scientiae Oeconomia Volume 4 (2016), No. 2.

PDF Lista B: 7pkt.

(2015). Sektor shadow banking w Polsce. Wydawnictwo CeDeWu.


(2007). Action for Growth. A resource to enable Scout Associations to develop a growth strategy. World Scout Bureau. European Regional Office on behalf of the European Scout Committee, Geneva 2007.


Recent Talks

& Posters

R is great for data analysis and Shiny is great for interactive data visualisation, but could we use R&Shiny for efficient declarative data collection? Moreover, how can we develop web data products in R&Shiny, that are based on real-time declarative data collection with after-question and after-survey instant feedback? Users of such web data products should be able to immediately access the feedback relevant to their answers. To increase the value of the feedback, it should be dynamically customised to each respondent. This can be achieved by pre-programmed templates of feedback scenarios, which can be adaptively customised by the respondent’s answers to this or previous questions. Employing large analytical and data visualisation capabilities in R, we could try to adapt any type of instant feedback to each user. Using R, we could also combine different feedback sources: a respondent’s answers to a given question and to other questions, other users’ answers, external open data (imported into our app or available via APIs), and aggregated or summarised outcomes from reference studies. What are the possibilities and obstacles for developing such data products natively in R&Shiny? How the idea of QAF (Question, Answer, and Feedback) objects can be implemented in R&Shiny? What is the roadmap for developing ODGAR framework for On-line Data Gathering, Analysing, and Reporting? Is it possible to build mobile app in R&Shiny? I will try to answer these questions using experience gained from developing early stage prototypes.

There is an urgent need for new declarative data that can help solve important social problems. However, such data are more and more often difficult to obtain even if the research project is non-for-profit and aims at solving some social problem of great importance. The main reason for this situation is the perseverance of traditional model of respondent-researcher relationship. This model is harmful to social science research in general, and often under-founded socially important research projects in particular. Additionally, traditional on-line research techniques which collect declarative data are obsolete. They do not fully take advantage of Internet technologies and specificity of the needs of Internet users. In order to advance declarative data collection for social good, we need to implement new model of long-term respondent-researcher relationship. In this model there is a need for close collaboration between social scientist, programmers and data scientists. This collaboration is necessary for the transformation of old social science research techniques into modern on-line data products for collecting declarative data and providing instant customized feedback for the respondents. The main goal of these new tools is to support stable on-line panels of respondents willing to participate in important social research projects in exchange for valuable content provided instantly by data scientists via the same research tool.

The global population of researchers, data scientists, and analysts from academia and private sector is hard to reach for quick and cost-effective survey needs. At the same time, quantified opinions of such experts are a valuable help for decision-making, public policies, and meta analyses of (open) science development. In the forthcoming age of the Open Science there is a strong need for tools and methods that would allow for quick and easy access to members of the scientific community for research purposes.

Recent Posts

More Posts

The second eRum was organized this year in Budapest (Hungary) and gathered ~500 participants (mostly, but not only, from Europe). It was a great event and a worthy successor of the first eRum organized in Poznań (Poland) two years ago. In the Workskop Day I participated in two workshops: Efficient R programming by Colin Gillespie (author of the Efficient R Programming book by O’Reilly) Building a package that lasts by Colin Fay.


Center for Research Computing (CRC) Od 1 sierpnia do 31 października 2017 r. miałem możliwość i ogromną satysfakcję gościć na Uniwersytecie Notre Dame w stanie Indiana. Pojechałem tam zaproszony przez dyrektora Center for Research Computing (CRC) prof. Jarka Nabrzyskiego, którego poznałem w Tokio rok wcześniej na konferencji plenarnej Research Data Alliance. W USA spędziłem trzy miesiące jako wizytujący assistant professor, przyglądając się pracy CRC, które jest jednym z kilku wydzielonych Centrów, działających na Uniwersytecie Notre Dame.


During my research visit at Notre Dame University I had the pleasure to participate in Hadley Wickham’s lecture Welcome to the Tidyverse and meet Hadley in person. Hadley’s talks are always well-structured and worth listening. Hadley Wickham has been a prime mover in releasing R upon the masses, enabling hordes of unsuspecting would-be researchers to process and visualize data in ways they never dreamed of. The tidyverse, the culmination of years of effort in the R language, is a universe of packages that facilitate a grammar of data, graphics, and modeling that allows even beginners to speak the language of data science fluently.


W sobotę (24.06) odbyło się na WSIiZ insprujące seminarium z udziałem on-line prof. P.Zimbardo. Philip Zimbardo był w 1971 roku liderem sławnego Stanfordzkiego Eksperymentu Więziennego i świadkiem-ekspertem w sprawie więzienia Abu Ghraib. Autor wpływowych książek: The Time Paradox: The New Psychology of Time That Will Change Your Life i The Lucifer Effect: Understanding How Good People Turn Evil. Warto zapoznać się z projektem Profesora Zimbardo: Heroic Imagination Project.


Here is a selection of my Data Science and Programming skills and tools that turned out to be helpful in my work and I believe are important for any Data Scientist as well. The last update of this post was done: 2018-05-23. Data Science Skills Data Manipulation Efficient data manipulation in R; dplyr, data.table, reshape2. Working with dates and time-series; lubridate, xts. Reproducible Analyses RStudio IDE, Markdown, LaTeX;



Research & Software

Closed Open Data

Barriers & bad practices in accessing Open Data. Case study of Poland.


The bridge between LimeSurvey and R.


The R package for gender prediction based on first names.

Resovia R User Group (RRUG)

R User Group in Rzeszów


Projekt badawczy poświęcony Polonii i Polakom poza granicami kraju.


Workshops & Lectures

  • Academic Teaching
    • Statistics
    • Inferential Statistics
    • Analysis and Visualisation of Economic Data
  • Workshops & Short Courses (including PhD level)
    • Introduction to R
    • Programming in R
    • Data Science in R
    • Developing Web Data Products in R & Shiny

Hire me!

What can I do for you?

  • Collaboration in research projects (non-profit & commercial)
    • reproducible data analysis
    • machine learning & statistical analysis
    • scripting for on-line surveys data collection
    • Computer-Aided Web Interviewing (CAWI)
    • interactive web reporting of research data
    • co-authoring research papers
  • R packages development
    • GitHub & CRAN deployment
  • Web Data Products development
    • reactive programming in R & Shiny
    • prototyping and expanding Shiny application
  • Consultations
    • methodology in Social Science
    • data collection
    • statistical analysis
    • data vizualization

Feel free to send me an e-mail if you have any questions!


See also

My Recommendations

Roche University of Notre Dame Research Data Alliance Open Data Institute ODI Agent for High-End IT Professionals