Nectar

Nectar considers energy of the mixture of experts model (fundamental to scaling for modern LLMs, Diffusion models, etc) and reroutes experts during inference (a new paradigm in test time adaption techniques) based upon energy profiles.

to consider: kernel optimizations for dequantization & ttt specifically (fused kernels, large chunk updates, tile packing), look into TTT architecture and MIT paper: https://arxiv.org/pdf/2505.23884 (Test time training done right) to consider (convexity proofs annd logic, large chun)

Nectar is essentially an analogy to switch transformer but hardware aware (for now), or another interpretation is considering TTT research on model weight updates but now as larger amount of experts are being intergrated in next gen LLM and GenAI, bringing that test time self supervised learning to re routing itself.

TODO: consider model sharding within GPU for memory management (good experimentation area)

THIS IS IMPORTANT: "Hardware-aware memory mapping" from Hyper Accel Adelia (https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11075108) --> understand fully & see how to implement for decode phase for nectar !

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.vscode		.vscode
chipmunk_triton		chipmunk_triton
energy		energy
kernels		kernels
logs		logs
math		math
models		models
notebooks		notebooks
results/thermal_experiment		results/thermal_experiment
src		src
test_full_workflow_results		test_full_workflow_results
test_results		test_results
test_slurm_debug		test_slurm_debug
train		train
ttt		ttt
.DS_Store		.DS_Store
06212025.txt		06212025.txt
DYNAMIC_EXPERT_REROUTING_INTEGRATION.md		DYNAMIC_EXPERT_REROUTING_INTEGRATION.md
ENERGY_AWARE_TTT_README.md		ENERGY_AWARE_TTT_README.md
HPC_TESTING_README.md		HPC_TESTING_README.md
README.md		README.md
README_PARALLEL_SYSTEM.md		README_PARALLEL_SYSTEM.md
SINGLE_GPU_TEST_README.md		SINGLE_GPU_TEST_README.md
SYNTHETIC_THERMAL_EXPERIMENT_README.md		SYNTHETIC_THERMAL_EXPERIMENT_README.md
SYNTHETIC_THERMAL_EXPERIMENT_SUMMARY.md		SYNTHETIC_THERMAL_EXPERIMENT_SUMMARY.md
TRITON_INTEGRATION_README.md		TRITON_INTEGRATION_README.md
abstract.txt		abstract.txt
analyze_energy_results.py		analyze_energy_results.py
big_idea1.txt		big_idea1.txt
create_thermal_visualizations.py		create_thermal_visualizations.py
current.txt		current.txt
etc.txt		etc.txt
hardware_synthetic_demo_results.json		hardware_synthetic_demo_results.json
hpc_single_gpu_test.py		hpc_single_gpu_test.py
meetingnote.txt		meetingnote.txt
more.txt		more.txt
next.txt		next.txt
notes_070825.txt		notes_070825.txt
outline.txt		outline.txt
paper_structure.txt		paper_structure.txt
part123.txt		part123.txt
penalty_fix_test_results.json		penalty_fix_test_results.json
phases1.txt		phases1.txt
quick_hpc_test.sh		quick_hpc_test.sh
quicknotes.txt		quicknotes.txt
run_benchmark_suite.sh		run_benchmark_suite.sh
run_dgpt2_ttt_single_gpu.sh		run_dgpt2_ttt_single_gpu.sh
run_dgpt2_ttt_triton_single_gpu.sh		run_dgpt2_ttt_triton_single_gpu.sh
run_hardware_synthetic_ttt_test.sh		run_hardware_synthetic_ttt_test.sh
run_hardware_synthetic_ttt_test_simple.sh		run_hardware_synthetic_ttt_test_simple.sh
run_hpc_test.pbs		run_hpc_test.pbs
run_hpc_test.slurm		run_hpc_test.slurm
run_lact_energy_aware.sh		run_lact_energy_aware.sh
run_moe_della.sh		run_moe_della.sh
run_moe_nograph_della.sh		run_moe_nograph_della.sh
run_parallel_energy_moe.slurm		run_parallel_energy_moe.slurm
run_single_gpu_test.py		run_single_gpu_test.py
run_synthetic_data_ttt_test.sh		run_synthetic_data_ttt_test.sh
run_synthetic_data_ttt_test_simple.sh		run_synthetic_data_ttt_test_simple.sh
run_synthetic_thermal_experiment.slurm		run_synthetic_thermal_experiment.slurm
run_synthetic_ttt_test_working.sh		run_synthetic_ttt_test_working.sh
simple_hardware_synthetic_results.json		simple_hardware_synthetic_results.json
simple_kcm_test.py		simple_kcm_test.py
test_baseline_fix.py		test_baseline_fix.py
test_baseline_issue.py		test_baseline_issue.py
test_dynamic_rerouting.py		test_dynamic_rerouting.py
test_energy_awareness_real.py		test_energy_awareness_real.py
test_energy_penalty.py		test_energy_penalty.py
test_energy_simple.py		test_energy_simple.py
test_energy_values.py		test_energy_values.py
test_full_experiment_workflow.py		test_full_experiment_workflow.py
test_hardware_synthetic_demo.py		test_hardware_synthetic_demo.py
test_hardware_synthetic_simple.py		test_hardware_synthetic_simple.py
test_penalty_fix.py		test_penalty_fix.py
test_penalty_fix_local.py		test_penalty_fix_local.py
test_penalty_fix_simple.py		test_penalty_fix_simple.py
test_synthetic_thermal_local.py		test_synthetic_thermal_local.py
test_synthetic_thermal_minimal.py		test_synthetic_thermal_minimal.py
test_synthetic_thermal_simple.py		test_synthetic_thermal_simple.py
whattomeasure.txt		whattomeasure.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nectar

About

Uh oh!

Releases

Packages

Languages

AbiralShakya/Nectar

Folders and files

Latest commit

History

Repository files navigation

Nectar

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages