- HTML 56.2%
- Python 19.5%
- JavaScript 11.5%
- R 9.4%
- CSS 1.7%
- Other 1.7%
| lambda | ||
| mediapipe/face_mesh | ||
| upload_script | ||
| analyze_eee.py | ||
| audio-recorder.js | ||
| baum_welch_hmm.py | ||
| calibration.css | ||
| calibration.html | ||
| calibration.js | ||
| cogsci-logo.png | ||
| CogSciLogo.png | ||
| data_analysis.py | ||
| data_viz.py | ||
| eee.py | ||
| file_name.py | ||
| formal_models.md | ||
| game.html | ||
| game2new.html | ||
| game3.html | ||
| https_server.py | ||
| index.html | ||
| levels.json | ||
| NLP_program.py | ||
| prepare_order_effects_data.py | ||
| R_data_analysis.R | ||
| ra-portal.html | ||
| README.md | ||
| replay.html | ||
| s3-uploader.js | ||
| server.py | ||
| stats.html | ||
| stratified_sampling.py | ||
| transcribe.py | ||
| webgazer.js | ||
Research Study: Mechanisms of Causal Reasoning
By Rachel Papirmeister
Research conducted under the supervision of Professor Mark Santolucito, Ph.D.
Neurosymbolic approaches to artificial intelligence (AI) move beyond traditional deep learning by integrating neural networks and symbolic systems, yielding enhancements in the interpretability, robustness, and efficiency of AI. This hybrid paradigm enables models to not only recognize patterns, but also manipulate abstract rules and reason over novel combinations of learned concepts, more closely mirroring the flexibility of human cognition. Although neurosymbolic AI appears to outperform earlier machine learning approaches, and multi-level optimization and agentic reasoning paradigms have achieved remarkable performance across many domains, AI systems overall continue to struggle in areas where human cognition excels: causal inference tasks, generalization from sparse data, and flexible problem-solving in novel environments. To investigate these disparities, I developed three original, instructionless grid-games with four to five levels each, wherein participants solve puzzles through real-time hypothesis testing and environment-driven exploration. Multimodal data was collected while participants navigated the puzzles; their verbal reasoning traces were elicited using a think-aloud protocol while quantitative data was automatically obtained, including eye-tracking, keystroke logs, timestamps, and game states of each participant. Beginning from grounded theory analysis of behavioral observations, verbal learning markers were tracked across concurrent trajectories of mechanic-based and objective-based exploration, and then classified into a taxonomy of knowledge acquisition using natural language processing for feature extraction. Subsequent mixed-effects regression analysis was used to determine whether relative puzzle performance could be predicted from verbal reasoning traces. Finally, human behavioral data laid the foundation for a novel agentic architecture: the system integrates self-generated causal history with a Bayesian probabilistic model to govern LLM transitions across three states of causal inquiry via a deterministic finite automaton (DFA) controller. Preliminary findings support a proposed update to the traditional explore-exploit framework by introducing a third, intermediary mechanism, ‘establish’, which encompasses the hypothesis-testing and confirmatory behaviors that bridge exploration and exploitation in human learning specifically from sparse data. The puzzle performance of an LLM augmented with this novel architecture was compared to performance of an LLM at baseline; the augmented LLM showed improved learning and performance on pre-existing puzzle benchmarks, particularly in problem spaces with sparse reward. Full results will be reported in the final paper, anticipated May 2026.
My broader objective is to leverage this insight in constructing domain-specific machine learning architectures specialized for few-shot learning and long-horizon planning, with implications for the design of human-centric, neurosymbolic systems and improving world models.