Help Wanted

Discover open source projects where your contributions truly matter

While large projects overflow with contributions, smaller initiatives eagerly await your expertise and passion

Active filters:Stars: 10+ Stars
Filters trigger API requests for better performance

Showing 30 opportunities to contribute

AI Summary: This feature request proposes adding a new dataset-level check function to validate data ingestion processes. The primary goal is to ensure that the number of records ingested accurately matches the upstream source, addressing a critical need in ingestion scenarios. This validation mechanism would support verifying metrics like row counts or executing custom SQL logic against upstream tables.

Complexity:4/5
enhancementgood first issue

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

Python
#data-profiling#data-quality#data-quality-monitoring#databricks#lakeflow#spark#spark-streaming#unity-catalog

AI Summary: The issue describes an intermittent bug where Eagle v2, when using the Triton attention backend, produces NaN values during the replay of the draft extend CUDA graph. This leads to a `torch.gather` out-of-bounds crash. The problem can be reproduced by running `test_eagle_infer_beta.py` with Triton attention and 400 concurrent requests, though it may require multiple attempts.

Complexity:4/5
buggood first issue

SGLang is a high-performance serving framework for large language models and multimodal models.

Python
#attention#blackwell#cuda#deepseek#diffusion#glm#gpt-oss#inference#llama#llm#minimax#moe#qwen#qwen-image#reinforcement-learning#transformer#vlm#wan

AI Summary: This issue suggests containerizing the project by adding a Dockerfile and related setup. The primary goal is to simplify the project's setup and deployment process, enabling contributors and users to run the application easily without encountering local dependency issues.

Complexity:3/5
good first issue

This project serves as an easy-to-understand setup for beginners , providing a base foundation in Spring Boot , MVC & hibernate.

Java
#e-commerce-project-in-java#eclipse#java#java-ecommerce-system#java-project#java-web-app#javaproject#jdbc#mysql#spring#spring-boot#spring-boot-project#spring-mvc#spring-projects#spring-web-project#springboot

AI Summary: This GitHub issue proposes a refactoring effort to improve type safety and code clarity within the Dify codebase. The goal is to replace generic `dict` and `Mapping` type hints with `TypedDict` definitions wherever possible, enhancing static analysis and developer experience. The issue specifically requests a list of files needing changes and an example of the proposed modification.

Complexity:3/5
good first issue🙏 help wanted

Production-ready platform for agentic workflow development.

TypeScript
#agent#agentic-ai#agentic-framework#agentic-workflow#ai#automation#gemini#genai#gpt#gpt-4#llm#low-code#mcp#nextjs#no-code#openai#orchestration#python#rag#workflow

AI Summary: This GitHub issue requests the conversion of existing unit tests for the epub viewer's search functionality from `@vue/test-utils` to Vue Testing Library. The tests, located in `SearchButton.spec.js` and `SearchSideBar.spec.js`, currently couple tests to component internals. The migration aims to promote testing user-facing behavior over implementation details.

Complexity:1/5
TAG: tech update / debtP3 - lowPLUGIN: epubDEV: frontendhelp wantedgood first issuecommunity-contribution-in-progress

Kolibri Learning Platform: the offline app for universal education

CSS
#edtech#education#hacktoberfest#oer#offline-first#sdg-4

AI Summary: This issue identifies a significant duplication in test cluster setup across Rust, C++, and Python, including constants like image names and orchestration logic for services like Zookeeper. It proposes unifying this infrastructure by having the Python test suite reuse the existing Rust cluster manager via PyO3, leveraging its async capabilities to reduce redundancy.

Complexity:3/5
good first issue

Rust Client for Apache Fluss (Incubating)

Rust
#fluss#lakehouse#python#real-time-analytics#rust#streaming

AI Summary: The current Profiler class allows users to filter columns when generating automated data quality check rules, but lacks the ability to select or deselect specific check functions (e.g., `is_not_null`, `min_max`) to be used. The proposal is to add an attribute to custom profiling options or an argument to the `generate_dq_rules` method, enabling users to filter these check functions.

Complexity:2/5
enhancementgood first issue

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

Python
#data-profiling#data-quality#data-quality-monitoring#databricks#lakeflow#spark#spark-streaming#unity-catalog

AI Summary: This GitHub issue is a "Good First Issue" designed for absolute beginners. The task involves adding a new regional Japanese dialect entry (number 63) to a JSON file. It requires no coding, can be completed entirely in the browser, and is estimated to take less than one minute.

Complexity:1/5
help wantedgood first issuehacktoberfestjavascriptcommunityeasylow hanging fruitbeginnerup-for-grabsfirst-timers-only

Aesthetic, minimalist platform for learning Japanese inspired by Duolingo and Monkeytype, built with Next.js and sponsored by Vercel. Beginner-friendly with plenty of good first issues - all contributions are welcome!

TypeScript
#beginner#beginner-friendly#contribute#contribution#contributions-welcome#first-contributions#first-timers-only#good-first-contribution#good-first-issue#good-first-pr#hacktoberfest#help-wanted#japanese#japanese-language#language-learning#learn-japanese#nextjs#open-source#react#up-for-grabs

AI Summary: The pattern detection engine, which includes 10 risk detectors, is currently disabled despite its existence. This issue aims to enable the engine by thoroughly testing each detector against real Neo4j data, fixing any false positives, documenting expected outputs, and adding integration tests for the pattern endpoints. The ultimate goal is to enable the `PATTERNS_ENABLED` flag in production.

Complexity:4/5
help wanted

AI Summary: This issue addresses the complete lack of API test coverage, which currently prevents guaranteeing system functionality after code changes. The task involves implementing a comprehensive integration test suite for various API endpoints, including search, chat, and security features. Tests will utilize pytest, httpx, and pytest-asyncio, run against real Neo4j and Redis databases, and be integrated into the CI pipeline.

Complexity:4/5
help wanted

AI Summary: This P0 issue highlights a critical lack of entity resolution, where the same entities appear across multiple datasets with inconsistent name formats, leading to no deduplication or linking. The proposed solution involves implementing a robust process that includes name normalization, exact matching using document IDs (CPF/CNPJ), fuzzy matching for entries without IDs, and merging identified duplicates into canonical entities with linked sources.

Complexity:4/5
help wanted

AI Summary: This GitHub issue aims to significantly improve the onboarding experience for new contributors by addressing common pain points in local setup. It proposes creating a comprehensive visual walkthrough (video/GIF), clarifying API key acquisition, providing IDE-specific guides, simplifying Docker usage, and adding a troubleshooting section and architecture diagram to the documentation. The goal is to make local environment setup much clearer and less intimidating for newcomers.

Complexity:4/5
documentationhelp wantedgood first issue

AI Summary: The current graph database has 9.1M company nodes but extremely low connectivity (0.004%), rendering it ineffective for showing relationships. This issue aims to build 20-40M `SOCIO_DE` relationships by processing QSA data, connecting Person nodes to Company nodes. This is a critical step for EGOS Inteligência to deliver on its core value proposition of demonstrating connections within the data.

Complexity:4/5
help wanted

AI Summary: This GitHub issue requests adding comprehensive unit tests for the `execution_cleanup` background service, which removes stale executions. The tests should cover stale execution detection, batch processing, graceful shutdown via context cancellation, error handling during storage operations, and correct metrics tracking, utilizing a mocked execution store.

Complexity:3/5
good first issuearea:control-planetests

Framework for AI Backend. Build and run AI agents like microservices - scalable, observable, and identity-aware from day one.

Go
#agent#agent-auth#agent-authentication#agent-indentity#agent-scaling#agentic-ai#ai#ai-backend#aiagent#anthropic#cloud-native#genai#go#kubernetes#llm#multiagent#multiagent-systems#python#rag#typescript

AI Summary: This is a beginner-friendly GitHub issue, labeled as a 'Good First Issue' and suitable for Hacktoberfest. The task is to add a specific Japanese video game quote to a JSON file, which can be done directly from the browser in under a minute without any coding or local setup.

Complexity:1/5
documentationhelp wantedgood first issuehacktoberfestcommunitylow hanging fruitbeginnerup-for-grabsfirst-timers-onlytypescript

Aesthetic, minimalist platform for learning Japanese inspired by Duolingo and Monkeytype, built with Next.js and sponsored by Vercel. Beginner-friendly with plenty of good first issues - all contributions are welcome!

TypeScript
#beginner#beginner-friendly#contribute#contribution#contributions-welcome#first-contributions#first-timers-only#good-first-contribution#good-first-issue#good-first-pr#hacktoberfest#help-wanted#japanese#japanese-language#language-learning#learn-japanese#nextjs#open-source#react#up-for-grabs

AI Summary: The user requests the automated build of two distinct firmware types: one specifically for ESP32 microcontrollers and another that integrates both FIDO2 and OpenPGP functionalities, likely from an existing `fido2` development branch. This aims to broaden hardware compatibility and provide a combined security feature set.

Complexity:4/5
enhancementgood first issuehelp wanted

A custom build pico-fido firmware(s) with LibreKeys applied patches

Nix
#firmware#nix

AI Summary: After exiting the terminal application, the text labels beneath the app icons on the home screen fail to render, appearing blank. This visual rendering issue is specific to closing the terminal app, as exiting other applications does not cause the same problem.

Complexity:3/5
buggood first issue

An E-Ink PDA device using the ESP32 S3.

C++

AI Summary: The `kysely` library's `MysqlDialectConfig` is currently incompatible with `mysql2@v3.18.2` due to a breaking type change in `mysql2`. The `query` method in `mysql2` now expects a more specific `QueryValues` type for its `parameters`, whereas `kysely` still uses `ReadonlyArray<unknown>`, leading to a type mismatch. A solution is needed to update `kysely`'s type definitions to align with the new `mysql2` types, with consideration for maintainability.

Complexity:3/5
good first issuemysqlbuilt-in dialecttypescriptgreenlit

A type-safe TypeScript SQL query builder

TypeScript
#browser#bun#database#deno#microsoft-sql-server#mssql#mysql#nodejs#postgresql#query-builder#sql#sqlite#type-safe#typescript

AI Summary: The GitHub release workflow is encountering issues due to an outdated goreleaser configuration (version 0, deprecated `archives.format`) and Go build errors that are specific to the CI environment, not reproducible locally or via Docker. The task involves updating the goreleaser configuration to version 2 and debugging/resolving the CI-specific build failures to enable successful releases.

Complexity:3/5
good first issueci

A fast and easy-to-use CDN, built with Go.

TypeScript
#cdn#gin#gin-gonic#gin-gorm#go#go-app#go-cdn#golang#gorm#gorm-orm

AI Summary: The user proposes an enhancement to make Prometheus installable via `pip install prometheus`. The primary motivation is to simplify the installation process, as the current method of building from source is considered tedious, and container-based installations are reportedly problematic on Mac M-series machines.

Complexity:4/5
enhancementgood first issue

AI Summary: The project has begun implementing `create()` methods to bootstrap various project types from templates, with current support for five specific types. The objective is to extend this functionality to encompass all existing project types, ensuring each new `create()` method is accompanied by a corresponding roundtrip test.

Complexity:4/5
enhancementgood first issue

A project about projects

Python
#projects#python

AI Summary: A high-severity Denial of Service (DoS) vulnerability (CVE-2024-21521, CVSS 7.5) has been identified in `@discordjs/opus` versions <= 0.9.0. This flaw allows an attacker to crash the process by providing a specially crafted input object, affecting the project's TTS service via `prism-media`'s `OggDemuxer`. The recommended solution is to update `@discordjs/opus` to version 0.10.0 or newer.

Complexity:2/5
good first issuesecuritycode-quality

小智AI客户端,目前主要用于MCP的对接

TypeScript

AI Summary: This GitHub issue reports an error when users click the 'search' button for 'Citation count' in the entry editor's 'General' tab, even though citation data might exist in the 'Citations' tab. The proposed solution is to enhance the `CitationCountEditor` by adding a dropdown menu, allowing users to explicitly select which citation fetcher service to use, thereby addressing the current error and providing more control.

Complexity:3/5
component: entry-editorgood first issue📍 Assigned📌 Pinned🔔 reminder-sent

Graphical Java application for managing BibTeX and BibLaTeX (.bib) databases

Java
#academia#academic-publications#ai#biblatex#bibliography#bibtex#citation#citation-generator#citation-style-language#citation-styles#hacktoberfest#jabref#java#javafx#javafx-application#javafx-desktop-apps#javafx-gui#latex#reference-manager#research-tool

AI Summary: This feature request proposes a "Runtime App Framework" for the device UI, enabling developers to add functionality without modifying and recompiling the core firmware. The solution envisions a sandboxed environment where small applications, written in a scripting language, can be loaded from an SD card and browsed from a dedicated screen. Key considerations include a restrictive scripting VM for security, user confirmation or signature verification for untrusted code, and the ability to compile some default apps into the firmware.

Complexity:5/5
enhancementgood first issue

meshtastic device-ui library

C

AI Summary: The user appreciates the ability to skip checks but notes that currently, skipped checks incorrectly mark all rows as invalid and add them to the invalid dataframe. They propose that skipped checks should not invalidate rows and request improved, structured logging for skipped checks, moving away from current text-based parsing.

Complexity:3/5
enhancementgood first issueQ1 2026

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

Python
#data-profiling#data-quality#data-quality-monitoring#databricks#lakeflow#spark#spark-streaming#unity-catalog

AI Summary: The BATCH plugin in OpenVINO currently does not expose the underlying hardware (HW) plugin's compiled model properties, such as those from a GPU plugin, in its `supported_properties` list. This issue requests to modify the BATCH plugin to retrieve and include these HW plugin properties, like `OPTIMAL_NUMBER_OF_INFER_REQUESTS`, in its own `supported_properties` when a user queries them.

Complexity:2/5
good first issueno_stale

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++
#ai#computer-vision#deep-learning#deploy-ai#diffusion-models#generative-ai#good-first-issue#inference#llm-inference#natural-language-processing#nlp#openvino#optimize-ai#performance-boost#recommendation-system#speech-recognition#stable-diffusion#transformers#yolo

AI Summary: This GitHub issue proposes integrating the newly developed `has_no_outliers` check function into the automated rule generation process. Currently, this function is only available for manual input, and the goal is to enable the profiler/generator to automatically suggest or apply this check.

Complexity:3/5
enhancementgood first issue

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

Python
#data-profiling#data-quality#data-quality-monitoring#databricks#lakeflow#spark#spark-streaming#unity-catalog

AI Summary: This issue requests the creation of a C API for the existing `Text2VideoPipeline` C++ class, mirroring the structure and conventions of the `LLMPipeline`'s C API. The task involves adding new C header and source files, exposing core functionalities like pipeline creation, destruction, generation, and configuration, along with developing a C sample and corresponding tests.

Complexity:3/5
good first issuecategory: C API

Run Generative AI models with simple C++/Python API and using OpenVINO Runtime

C++

AI Summary: This issue proposes enhancing the `rpcclient` by adding convenience wrapper functions for commonly fetched ledger entries. Specifically, it requests `getAccountEntry`, `getTrustline`, and `getClaimableBalance` methods to simplify their retrieval. Existing pull requests in `js-stellar-sdk` are provided as references for implementation guidance.

Complexity:2/5
rpc-sdk-scrumgood first issue

Stellar's Go SDK

Go
#blockchain#cryptocurrency#horizon#stellar