No description
  • HTML 56.2%
  • Python 19.5%
  • JavaScript 11.5%
  • R 9.4%
  • CSS 1.7%
  • Other 1.7%
Find a file
2026-05-04 15:07:34 -04:00
lambda Fix getStats Lambda timeout: replace transformToString with stream reader 2026-02-21 10:17:46 -05:00
mediapipe/face_mesh Add missing MediaPipe binary assets (wasm, data files) 2026-02-11 11:14:30 -05:00
upload_script Updated script instructions for new script. 2026-01-05 22:49:15 -05:00
analyze_eee.py moved legend 2026-04-29 00:57:55 -04:00
audio-recorder.js Show warning when eye tracking fails to initialize 2026-02-13 17:24:32 -05:00
baum_welch_hmm.py Add Baum-Welch HMM program for analyzing behavioral data 2026-01-14 21:03:30 -05:00
calibration.css Fix skip button not clickable due to WebGazer overlay z-index 2026-02-02 21:15:48 -05:00
calibration.html Use local webgazer.js instead of CDN to fix MediaPipe 404 errors 2026-02-11 11:07:00 -05:00
calibration.js Fix WebGazer 't is not a function' error during calibration 2026-02-11 11:00:12 -05:00
cogsci-logo.png Add logo to top left, move credit to top right 2026-02-09 11:30:20 -05:00
CogSciLogo.png crop logo 2026-02-18 03:10:39 -05:00
data_analysis.py Fix three participant tracker lookup failures dropping P009, P036, P045 2026-04-03 00:13:18 -04:00
data_viz.py Rewrite data_viz.py for speech-movement analysis 2026-01-20 02:28:38 -05:00
eee.py Add Flask server and wire up frontend integration in eee.py 2026-04-07 17:09:20 -04:00
file_name.py tweaks 2026-03-08 22:55:28 -04:00
formal_models.md Update formal_models.md 2026-04-30 15:24:47 -04:00
game.html Add EEE agent integration to game.html and game2new.html 2026-04-07 17:13:56 -04:00
game2new.html levels logging fix 2026-04-29 00:34:57 -04:00
game3.html Add completion overlay with confetti on final level 2026-02-13 18:42:32 -05:00
https_server.py Add modular audio recording with speech transcription 2025-11-25 12:12:11 +00:00
index.html grammar 2026-04-29 23:27:56 -04:00
levels.json Add ARC-style puzzle game with 6x6 grid 2025-10-17 14:32:38 -04:00
NLP_program.py Strengthen Establish/Exploit markers with weighted scoring, exclude P031 from transcripts 2026-04-02 21:08:53 -04:00
prepare_order_effects_data.py Add order effects analysis for counterbalancing validation 2026-01-20 12:19:52 -05:00
R_data_analysis.R Fix statistical analysis issues in R script 2026-01-20 13:21:40 -05:00
ra-portal.html Add line break in diction guideline 2026-02-22 23:23:06 -05:00
README.md Update README.md 2026-03-29 23:26:55 -04:00
replay.html X symbol 2026-05-04 15:07:34 -04:00
s3-uploader.js Add debug logging to S3Uploader upload flow 2026-02-11 17:59:22 -05:00
server.py Add Flask server and wire up frontend integration in eee.py 2026-04-07 17:09:20 -04:00
stats.html Centralize CORS config and add detailed session metrics to stats 2026-02-13 09:08:06 -05:00
stratified_sampling.py tightened RA speech filter 2026-04-01 22:40:57 -04:00
transcribe.py Add interpretation guide to data_analysis.py; update transcribe.py to single-file mode 2026-04-02 21:54:35 -04:00
webgazer.js Fix webgazer: replace truncated local copy and add MediaPipe assets 2026-02-11 11:09:30 -05:00

Research Study: Mechanisms of Causal Reasoning

CogSci Logo

By Rachel Papirmeister

Research conducted under the supervision of Professor Mark Santolucito, Ph.D.


Neurosymbolic approaches to artificial intelligence (AI) move beyond traditional deep learning by integrating neural networks and symbolic systems, yielding enhancements in the interpretability, robustness, and efficiency of AI. This hybrid paradigm enables models to not only recognize patterns, but also manipulate abstract rules and reason over novel combinations of learned concepts, more closely mirroring the flexibility of human cognition. Although neurosymbolic AI appears to outperform earlier machine learning approaches, and multi-level optimization and agentic reasoning paradigms have achieved remarkable performance across many domains, AI systems overall continue to struggle in areas where human cognition excels: causal inference tasks, generalization from sparse data, and flexible problem-solving in novel environments. To investigate these disparities, I developed three original, instructionless grid-games with four to five levels each, wherein participants solve puzzles through real-time hypothesis testing and environment-driven exploration. Multimodal data was collected while participants navigated the puzzles; their verbal reasoning traces were elicited using a think-aloud protocol while quantitative data was automatically obtained, including eye-tracking, keystroke logs, timestamps, and game states of each participant. Beginning from grounded theory analysis of behavioral observations, verbal learning markers were tracked across concurrent trajectories of mechanic-based and objective-based exploration, and then classified into a taxonomy of knowledge acquisition using natural language processing for feature extraction. Subsequent mixed-effects regression analysis was used to determine whether relative puzzle performance could be predicted from verbal reasoning traces. Finally, human behavioral data laid the foundation for a novel agentic architecture: the system integrates self-generated causal history with a Bayesian probabilistic model to govern LLM transitions across three states of causal inquiry via a deterministic finite automaton (DFA) controller. Preliminary findings support a proposed update to the traditional explore-exploit framework by introducing a third, intermediary mechanism, establish, which encompasses the hypothesis-testing and confirmatory behaviors that bridge exploration and exploitation in human learning specifically from sparse data. The puzzle performance of an LLM augmented with this novel architecture was compared to performance of an LLM at baseline; the augmented LLM showed improved learning and performance on pre-existing puzzle benchmarks, particularly in problem spaces with sparse reward. Full results will be reported in the final paper, anticipated May 2026.


My broader objective is to leverage this insight in constructing domain-specific machine learning architectures specialized for few-shot learning and long-horizon planning, with implications for the design of human-centric, neurosymbolic systems and improving world models.

For research inquiries, please email r.papirmeister@columbia.edu.

To see the mechanics and win conditons for my puzzles, check out my Pseudo Code.