Skip to content
View gagolews's full-sized avatar
🙃
🙃

Highlights

  • Pro

Organizations

@madam-research-group @DataScienceRetreat

Block or report gagolews

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gagolews/README.md

My current research interests are related to data science — with a focus on modelling complex phenomena, developing usable, general-purpose algorithms, studying their analytical properties, and finding out how people (laymen, decision makers, students, and researchers from different fields) use, misuse, understand, and misunderstand data analysis methods in scientific, business, political, social, and other settings. In my spare time, I write books for my students and develop open-source data analysis software.

Open-access textbooks

Software

Python packages

R packages

  • stringi – Fast and portable character string processing in R (one of the most often downloaded packages for R) (GitHub) (CRAN) (paper)
  • genieclust – Fast and robust hierarchical clustering with noise point detection (GitHub) (CRAN) (paper)
  • deadwood – Outlier detection via trimming of mutual reachability minimum spanning trees (GitHub) (CRAN)
  • quitefastmst – Euclidean and mutual reachability minimum spanning tree algorithms (GitHub) (CRAN)
  • stringx – Drop-in replacements for base R string functions powered by stringi (GitHub) (CRAN)
  • realtest – Where expectations meet reality: Realistic unit testing in R (GitHub) (CRAN)
  • TurtleGraphics – Learn computer programming in R while having a jolly time! (GitHub) (CRAN)

Data

Pinned Loading

  1. deepr deepr Public

    Deep R Programming (Open-Access Textbook)

    117 4

  2. datawranglingpy datawranglingpy Public

    Minimalist Data Wrangling with Python (Open-Access Textbook)

    86 4

  3. stringi stringi Public

    Fast and portable character string processing in R (with the Unicode ICU)

    C++ 318 47

  4. genieclust genieclust Public

    Genie: Fast and Robust Hierarchical Clustering with Noise Point Detection - in Python and R

    C++ 70 12

  5. stringx stringx Public

    Drop-in replacements for base R string functions powered by stringi

    HTML 28

  6. clustering-benchmarks clustering-benchmarks Public

    A framework for benchmarking clustering algorithms

    Python 44 8