{"id":186582,"date":"2025-05-05T10:22:41","date_gmt":"2025-05-05T14:22:41","guid":{"rendered":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/?p=186582"},"modified":"2026-05-04T10:43:47","modified_gmt":"2026-05-04T14:43:47","slug":"paychex-2","status":"publish","type":"post","link":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/","title":{"rendered":"Paychex"},"content":{"rendered":"\n<p><strong>Project Title:<\/strong>&nbsp;An AI-Powered Regulatory Chatbot for Labor Legislation<br><strong>Team Members<\/strong>: Ethan Leung, Carol Li, Astha Singh, Keming Zhang<br><strong>Sponsor:<\/strong>&nbsp;Paychex<br><strong>Affiliation<\/strong>: Goergen Institute for Data Science, University of Rochester<\/p>\n\n\n<h2>INTRODUCTION<\/h2>\n\n\n<p><span style=\"font-weight: 400;\">In today\u2019s fast-evolving regulatory landscape, businesses must constantly adapt to changes in federal and state legislation to ensure compliance in areas such as payroll, taxation, and human resources. Paychex, a leading provider of integrated HR solutions, faces the challenge of maintaining up-to-date regulatory information to effectively support over 740,000 clients. This capstone project focused on an <\/span><b>AI-powered chatbot<\/b><span style=\"font-weight: 400;\"> designed to assist Paychex in <\/span><b>delivering accurate and timely regulatory guidance<\/b><span style=\"font-weight: 400;\">.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture1.png\"><img loading=\"lazy\" decoding=\"async\" width=\"518\" height=\"292\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture1.png\" alt=\"\" class=\"wp-image-204012\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture1.png 518w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture1-300x169.png 300w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture1-340x191.png 340w\" sizes=\"auto, (max-width: 518px) 100vw, 518px\" \/><\/a><\/figure>\n\n\n\n<p><a href=\"https:\/\/news.cornell.edu\/stories\/2024\/07\/research-decision-making-mystery-ai-chatbots\">https:\/\/news.cornell.edu\/stories\/2024\/07\/research-decision-making-mystery-ai-chatbots<\/a><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">OBJECTIVES<\/h2>\n\n\n\n<p>Our work culminates in a scalable, automated system that improves access to complex legal information, supporting efficient and dependable client service. By combining advanced AI methods with robust data management, the system aims to provide Paychex with reliable, up-to-date information, which enhances operational efficiency and ensures adherence to complex regulatory requirements.<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Analyze and prepare data for model integration<\/li>\n\n\n\n<li>Build and optimize the vector database<\/li>\n\n\n\n<li>Develop the retrieval-augmented generation agent graph<\/li>\n\n\n\n<li>Integrate the chatbot with data and graph functions<\/li>\n<\/ol>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>DATA DESCRIPTION<\/strong><\/h2>\n\n\n\n<p>The original dataset comprises regulatory information related to <strong>labor laws at both the federal and state levels<\/strong>. The dataset initially contained <strong>1,124 entries<\/strong>, with each row representing a distinct regulation, accompanied by a brief description and a hyperlink to the official source. One entry was removed due to a missing URL, resulting in a final dataset of 1,123 complete records for analysis.<\/p>\n\n\n\n<p>There are three types of data sources\u2014<strong>Web Resource, PDF, and HTML<\/strong>\u2014each scraped using distinct techniques. A total of <strong>1036 links were successfully scraped<\/strong>, while 87 errors remain. All extracted text was compiled into the dataset.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"633\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-4-1024x633.png\" alt=\"\" class=\"wp-image-204062\" style=\"width:633px;height:auto\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-4-1024x633.png 1024w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-4-300x186.png 300w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-4-768x475.png 768w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-4-1536x950.png 1536w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-4-2048x1267.png 2048w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-4-1920x1187.png 1920w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-text-align-center\"><em>Figure 1: Distribution of File Types<\/em><\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"610\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-3-1024x610.png\" alt=\"\" class=\"wp-image-204072\" style=\"width:673px;height:auto\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-3-1024x610.png 1024w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-3-300x179.png 300w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-3-768x457.png 768w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-3-1536x915.png 1536w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-3-2048x1220.png 2048w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-3-1920x1143.png 1920w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center\" style=\"text-align: center;\"><em>Figure 2: Distribution of Errors<\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>METHODS<\/strong><\/h2>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Feature Engineering<\/strong><\/h3>\n\n\n\n<p>Procedure based on the Description column:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Added \u201c<strong>Category<\/strong>\u201d column based on keywords in the description: paid_sick_leave, childcare_regulations, wage_minimum_industry, holiday_pay_provisions\uff0cwork_hour_regulations, employment_statutes, fair_labor_standards\uff0cemployee_leave_benefits.<\/li>\n\n\n\n<li>Added a <strong>Jurisdiction<\/strong> column to categorize each row as either Federal or State.<\/li>\n\n\n\n<li>Added <strong>State and State Code<\/strong> columns to classify all State-level regulations according to the 50 U.S. states.<\/li>\n<\/ul>\n\n\n\n<p>Among the eight categorized themes, <em>paid_sick_leave<\/em> accounts for the highest number of regulations (255), followed by <em>childcare_regulations<\/em> (238) and <em>wage_minimum_industry<\/em> (216). In contrast, categories such as <em>fair_labor_standards<\/em> (64) and <em>employee_leave_benefits<\/em> (44) are represented less frequently. This distribution highlights the relative prominence of certain labor regulation themes, particularly those related to paid leave and childcare, within the dataset.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"946\" height=\"630\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture2-1.png\" alt=\"\" class=\"wp-image-204142\" style=\"width:605px;height:auto\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture2-1.png 946w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture2-1-300x200.png 300w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture2-1-768x511.png 768w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture2-1-660x440.png 660w\" sizes=\"auto, (max-width: 946px) 100vw, 946px\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-text-align-center\"><em>Figure 3: Distribution of Regulation Categories<\/em><\/p>\n\n\n\n<p>Another feature added during the engineering process was the <em>Jurisdiction<\/em> column, which classifies each regulation as either Federal or State in scope. A figure was created to show the distribution of Jurisdiction. We can observe that there are 911 state-related regulations and 212 federal-level regulations. The number of state regulations is highly outweighs the number of federal laws.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"632\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-1-1024x632.png\" alt=\"\" class=\"wp-image-204202\" style=\"width:598px;height:auto\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-1-1024x632.png 1024w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-1-300x185.png 300w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-1-768x474.png 768w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-1-1536x948.png 1536w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-1-2048x1264.png 2048w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-1-1920x1185.png 1920w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-text-align-center\"><em>Figure 4: Distribution of Jurisdiction<\/em><\/p>\n\n\n\n<p>Furthermore, two supporting columns\u2014<em>State<\/em> and <em>State Code<\/em>\u2014were introduced to facilitate the classification of all state-level regulations according to the 50 U.S. states. Analysis of these columns reveals that the majority of state regulations in the dataset originate from a subset of states, including Maryland (MD), Minnesota (MN), New York (NY), California (CA), Montana (MT), Arizona (AZ), New Jersey (NJ), Massachusetts (MA), New Mexico (NM), and Maine (ME). This distribution may reflect differences in regulatory activity or data availability across states.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"806\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-2-1024x806.png\" alt=\"\" class=\"wp-image-204232\" style=\"width:559px;height:auto\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-2-1024x806.png 1024w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-2-300x236.png 300w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-2-768x604.png 768w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-2-1536x1209.png 1536w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-2.png 1915w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-text-align-center\"><em>Figure 5: Distribution of Number of Laws by State<\/em><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>FAISS Vector Database<\/strong><\/h3>\n\n\n\n<p>A brief steps as follows:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Database&nbsp; 2. Embedding &nbsp; 3. FAISS Index&nbsp; 4. Indexing &nbsp; 5. TF-IDF&nbsp; 6. Searching<\/li>\n<\/ol>\n\n\n\n<p>We used FAISS (Facebook AI Similarity Search) to build a fast, efficient search system that matches user queries to relevant labor laws. By converting legal text into vector embeddings using models like BERT, the chatbot can quickly find semantically similar content. We also incorporated TF-IDF scoring to enhance result accuracy and relevance.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Hybrid Search<\/strong><\/h3>\n\n\n\n<p>To improve precision, we combined semantic vector search with traditional keyword matching in a hybrid approach. This ensures that the chatbot not only understands the meaning behind a query but also captures important legal terms. By ranking results based on both methods, the chatbot delivers more relevant and legally accurate answers.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>LangGraph<\/strong><\/h3>\n\n\n\n<p>We used LangGraph to manage the chatbot&#8217;s multi-step reasoning, including searching, filtering, and generating responses. This structure allows the system to fall back on a controlled Google Search if no internal data matches a query, ensuring reliability. Prompts guide each AI agent\u2019s role, enabling smooth and accurate interactions.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>RESULTS<\/strong><\/h2>\n\n\n\n<p>The project resulted in the creation of three models that improved upon each other. The workflow of the chatbot is pictured below, along with a model comparison of each model\u2019s performance in a few common scenarios. Observably, the first version was the fastest but gave too succinct responses. The second sacrificed speed for answer quality, but fell short in hyper-specific queries. The final sees the same speed decrease, but it works well in all scenarios.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"928\" height=\"1024\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-6-928x1024.png\" alt=\"\" class=\"wp-image-204352\" style=\"width:574px;height:auto\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-6-928x1024.png 928w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-6-272x300.png 272w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-6-768x848.png 768w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-6-1392x1536.png 1392w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-6-1856x2048.png 1856w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-6-1920x2119.png 1920w\" sizes=\"auto, (max-width: 928px) 100vw, 928px\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-text-align-center\"><em>Figure 6: Model Comparison of Each Model\u2019s Performance<\/em><\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"588\" height=\"1024\" src=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-5-588x1024.png\" alt=\"\" class=\"wp-image-204342\" style=\"width:482px;height:auto\" srcset=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-5-588x1024.png 588w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-5-172x300.png 172w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-5-768x1337.png 768w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-5-882x1536.png 882w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-5-1177x2048.png 1177w, https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/Picture3.png-5.png 1627w\" sizes=\"auto, (max-width: 588px) 100vw, 588px\" \/><\/figure>\n\n\n\n<p class=\"has-text-align-center has-text-align-center\"><em>Figure 7: LangGraph Workflow Version 3<\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>DEMO<\/strong><\/h2>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"wp-embed\"><div class=\"wp-embed-wrap\"><iframe loading=\"lazy\" title=\"paychex demo\" width=\"1062\" height=\"597\" src=\"https:\/\/www.youtube.com\/embed\/2pJVTn4eS7s?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/div><\/div>\n<\/div><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>CONCLUSION &amp; NEXT STEPS<\/strong><\/h2>\n\n\n\n<p>As a whole, the project opens avenues of usage for similar use cases in other fields to create specialized chatbots to aid in specific research. A major area of improvement for a project like this could be to refine the Google Search fallback to be used more prominently, as it would reduce the need to manage the initial knowledge base.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>ACKNOWLEDGEMENT<\/strong><\/h2>\n\n\n\n<p>We would like to thank Professors Anand and Caliskan for their continuous support and mentorship, as well as Ledion Lico, Daniel Riggi, Ravi Dugh, Michelle Li, Lilly Xie, and Jing Zhu at Paychex for their guidance throughout the project.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>REFERENCES<\/strong><\/h2>\n\n\n\n<p>1.What is Rag? &#8211; retrieval-augmented generation AI explained &#8211; AWS. (n.d.). https:\/\/aws.amazon.com\/what-is\/retrieval-augmented-generation\/\u00a0<\/p>\n\n\n\n<p>2.Roller, J. (2024, May 15). <em>Understanding vector databases: The foundation of modern AI applications<\/em>. IEEE Computer Society. https:\/\/www.computer.org\/publications\/tech-news\/community-voices\/vector-databases-and-ai-applications\/&nbsp;<\/p>\n\n\n\n<p>3.Web scraping: Jurisprudence and legal doctrines &#8211; fontana &#8211; 2025 &#8211; the journal of world intellectual property &#8211; wiley online library. (n.d.-a). https:\/\/onlinelibrary.wiley.com\/doi\/10.1111\/jwip.12331?af=R<\/p>\n\n\n\n<p>4.Vine, J. (n.d.). <em>pdfplumber<\/em> [Computer software]. GitHub. <a href=\"https:\/\/github.com\/jsvine\/pdfplumber\">https:\/\/github.com\/jsvine\/pdfplumber<\/a><\/p>\n\n\n\n<p>5.Richardson, L. (2007). <em>Beautiful Soup Documentation<\/em>. Crummy.com. <a href=\"https:\/\/www.crummy.com\/software\/BeautifulSoup\/\">https:\/\/www.crummy.com\/software\/BeautifulSoup\/<\/a><\/p>\n\n\n\n<p>6.SeleniumHQ. (n.d.). <em>Selenium WebDriver<\/em>. https:\/\/www.selenium.dev\/documentation\/webdriver\/<\/p>\n\n\n\n<p>7.Reitz, K. (n.d.). <em>Requests: HTTP for Humans<\/em>. Python-Requests.org. https:\/\/docs.python-requests.org\/<\/p>\n\n\n\n<p>8.<em>Faiss<\/em><em> | \ud83e\udd9c\ufe0f\ud83d\udd17 <\/em><em>LangChain<\/em>. (2024). Langchain.com. https:\/\/python.langchain.com\/docs\/integrations\/vectorstores\/faiss\/<\/p>\n\n\n\n<p>9.Reimers, N., &amp; Gurevych, I. (2019, August 27). <em>Sentence-<\/em><em>bert<\/em><em>: Sentence embeddings using Siamese Bert-Networks<\/em>. arXiv.org. https:\/\/arxiv.org\/abs\/1908.10084<\/p>\n\n\n\n<p>10.Johnson, J., Douze, M., &amp; J\u00e9gou, H. (2017, February 28). <em>Billion-scale similarity search with <\/em><em>gpus<\/em>. arXiv.org. https:\/\/arxiv.org\/abs\/1702.08734?context=cs<\/p>\n\n\n\n<p>11.Gao, L., Dai, Z., &amp; Callan, J. (2021, April 15). <em>Coil: Revisit exact lexical match in information retrieval with contextualized inverted list<\/em>. arXiv.org. https:\/\/arxiv.org\/abs\/2104.07186<\/p>\n\n\n\n<p>12.Cohorte. (2024, November 15). <em>Unleashing the Power of <\/em><em>LangGraph<\/em><em>: An Introduction to the Future of AI Workflows<\/em>. Cohorte.co; Cohorte. https:\/\/www.cohorte.co\/blog\/unleashing-the-power-of-langgraph-an-introduction-to-the-future-of-ai-workflows<\/p>\n\n\n\n<p>&nbsp;<\/p>\n\n\n\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Project Title:&nbsp;An AI-Powered Regulatory Chatbot for Labor LegislationTeam Members: Ethan Leung, Carol Li, Astha Singh, Keming ZhangSponsor:&nbsp;PaychexAffiliation: Goergen Institute for Data Science, University of Rochester INTRODUCTION In today\u2019s fast-evolving regulatory&hellip;<\/p>\n","protected":false},"author":15722,"featured_media":204672,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_coblocks_attr":"","_coblocks_dimensions":"","_coblocks_responsive_height":"","_coblocks_accordion_ie_support":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[4442,16042,96],"tags":[16522,19772],"coauthors":[20352,20362,20372,20202],"class_list":["post-186582","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-archive","category-artificial-intelligence","category-csc-archive","tag-data-science","tag-embedded-ai"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Paychex - Senior Design Day<\/title>\n<meta name=\"description\" content=\"We engineered a custom dataset of over 1,100 legal regulations, developed a vector search system using FAISS, and implemented Google Gemini\u2019s language model to optimize speed and accuracy. This tool is a scalable solution designed to improve legal compliance support and could be adapted to other industries that require precise regulatory information.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Paychex - Senior Design Day\" \/>\n<meta property=\"og:description\" content=\"We engineered a custom dataset of over 1,100 legal regulations, developed a vector search system using FAISS, and implemented Google Gemini\u2019s language model to optimize speed and accuracy. This tool is a scalable solution designed to improve legal compliance support and could be adapted to other industries that require precise regulatory information.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Senior Design Day\" \/>\n<meta property=\"article:published_time\" content=\"2025-05-05T14:22:41+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-05-04T14:43:47+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/logo-1200x424.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"424\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Ethan Leung, Carol Li, Astha Singh, Keming Zhang\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Ethan Leung, Carol Li, Astha Singh, Keming Zhang\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/\"},\"author\":{\"name\":\"Ethan Leung\",\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/#\\\/schema\\\/person\\\/f806009484b11725d3e68e9cc939c980\"},\"headline\":\"Paychex\",\"datePublished\":\"2025-05-05T14:22:41+00:00\",\"dateModified\":\"2026-05-04T14:43:47+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/\"},\"wordCount\":1186,\"image\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/wp-content\\\/uploads\\\/2025\\\/05\\\/logo.png\",\"keywords\":[\"Data Science\",\"Embedded AI\"],\"articleSection\":[\"3. Programs Archive\",\"Artificial Intelligence\",\"CSC Archive\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/\",\"url\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/\",\"name\":\"Paychex - Senior Design Day\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/wp-content\\\/uploads\\\/2025\\\/05\\\/logo.png\",\"datePublished\":\"2025-05-05T14:22:41+00:00\",\"dateModified\":\"2026-05-04T14:43:47+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/#\\\/schema\\\/person\\\/f806009484b11725d3e68e9cc939c980\"},\"description\":\"We engineered a custom dataset of over 1,100 legal regulations, developed a vector search system using FAISS, and implemented Google Gemini\u2019s language model to optimize speed and accuracy. This tool is a scalable solution designed to improve legal compliance support and could be adapted to other industries that require precise regulatory information.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/wp-content\\\/uploads\\\/2025\\\/05\\\/logo.png\",\"contentUrl\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/wp-content\\\/uploads\\\/2025\\\/05\\\/logo.png\",\"width\":2560,\"height\":424},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/paychex-2\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Paychex\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/#website\",\"url\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/\",\"name\":\"Senior Design Day\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/#\\\/schema\\\/person\\\/f806009484b11725d3e68e9cc939c980\",\"name\":\"Ethan Leung\",\"url\":\"https:\\\/\\\/www.hajim.rochester.edu\\\/senior-design-day\\\/author\\\/eleung6\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Paychex - Senior Design Day","description":"We engineered a custom dataset of over 1,100 legal regulations, developed a vector search system using FAISS, and implemented Google Gemini\u2019s language model to optimize speed and accuracy. This tool is a scalable solution designed to improve legal compliance support and could be adapted to other industries that require precise regulatory information.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/","og_locale":"en_US","og_type":"article","og_title":"Paychex - Senior Design Day","og_description":"We engineered a custom dataset of over 1,100 legal regulations, developed a vector search system using FAISS, and implemented Google Gemini\u2019s language model to optimize speed and accuracy. This tool is a scalable solution designed to improve legal compliance support and could be adapted to other industries that require precise regulatory information.","og_url":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/","og_site_name":"Senior Design Day","article_published_time":"2025-05-05T14:22:41+00:00","article_modified_time":"2026-05-04T14:43:47+00:00","og_image":[{"width":1200,"height":424,"url":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/logo-1200x424.png","type":"image\/png"}],"author":"Ethan Leung, Carol Li, Astha Singh, Keming Zhang","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Ethan Leung, Carol Li, Astha Singh, Keming Zhang","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/#article","isPartOf":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/"},"author":{"name":"Ethan Leung","@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/#\/schema\/person\/f806009484b11725d3e68e9cc939c980"},"headline":"Paychex","datePublished":"2025-05-05T14:22:41+00:00","dateModified":"2026-05-04T14:43:47+00:00","mainEntityOfPage":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/"},"wordCount":1186,"image":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/#primaryimage"},"thumbnailUrl":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/logo.png","keywords":["Data Science","Embedded AI"],"articleSection":["3. Programs Archive","Artificial Intelligence","CSC Archive"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/","url":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/","name":"Paychex - Senior Design Day","isPartOf":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/#primaryimage"},"image":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/#primaryimage"},"thumbnailUrl":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/logo.png","datePublished":"2025-05-05T14:22:41+00:00","dateModified":"2026-05-04T14:43:47+00:00","author":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/#\/schema\/person\/f806009484b11725d3e68e9cc939c980"},"description":"We engineered a custom dataset of over 1,100 legal regulations, developed a vector search system using FAISS, and implemented Google Gemini\u2019s language model to optimize speed and accuracy. This tool is a scalable solution designed to improve legal compliance support and could be adapted to other industries that require precise regulatory information.","breadcrumb":{"@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/#primaryimage","url":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/logo.png","contentUrl":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-content\/uploads\/2025\/05\/logo.png","width":2560,"height":424},{"@type":"BreadcrumbList","@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/paychex-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/"},{"@type":"ListItem","position":2,"name":"Paychex"}]},{"@type":"WebSite","@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/#website","url":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/","name":"Senior Design Day","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/#\/schema\/person\/f806009484b11725d3e68e9cc939c980","name":"Ethan Leung","url":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/author\/eleung6\/"}]}},"_links":{"self":[{"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/posts\/186582","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/users\/15722"}],"replies":[{"embeddable":true,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/comments?post=186582"}],"version-history":[{"count":18,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/posts\/186582\/revisions"}],"predecessor-version":[{"id":205062,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/posts\/186582\/revisions\/205062"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/media\/204672"}],"wp:attachment":[{"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/media?parent=186582"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/categories?post=186582"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/tags?post=186582"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.hajim.rochester.edu\/senior-design-day\/wp-json\/wp\/v2\/coauthors?post=186582"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}