Commits · visualisable-ai/api

fix: Use eager attention for output_attentions support

5333b21

gary-boon Claude Opus 4.5 commited on 24 days ago

fix: Skip heavy ML deps in CI security checks

ba27c0c

gary-boon Claude Opus 4.5 commited on 24 days ago

fix: Update torch to 2.3+ for transformers compatibility

1b73605

gary-boon Claude Opus 4.5 commited on 24 days ago

fix: Update transformers for Devstral support

b788304

gary-boon Claude Opus 4.5 commited on 24 days ago

docs: Mark GPU HF Space Devstral deployment complete

65c6e2e

gary-boon Claude Opus 4.5 commited on 24 days ago

docs: Update phased plan with Phase 2/2b/2c completion status

688efad

gary-boon Claude Opus 4.5 commited on 24 days ago

Add vocabSize to modelInfo response

499afba

gary-boon Claude Opus 4.5 commited on 24 days ago

Update .env.spark.example: TORCH_DTYPE now auto-detected

543454f

gary-boon Claude Opus 4.5 commited on 24 days ago

Add recommended_dtype to model configs

62525b2

gary-boon Claude Opus 4.5 commited on 24 days ago

Phase 2: Add Devstral backend support

9080f28

gary-boon Claude Opus 4.5 commited on 24 days ago

Update plan: Phase 1 paused due to GB10 GPU support

e694533

gary-boon Claude Opus 4.5 commited on 24 days ago

Add DEVICE env var to force CPU mode on DGX Spark

5f122aa

gary-boon Claude Opus 4.5 commited on 24 days ago

Use NGC PyTorch 24.08 for Python 3.10 compatibility

a2875a2

gary-boon Claude Opus 4.5 commited on 24 days ago

Use NVIDIA NGC PyTorch container for GB10 support

a4cfbff

gary-boon Claude Opus 4.5 commited on 24 days ago

Try PyTorch nightly for GB10/sm_121 GPU support

a009a49

gary-boon Claude Opus 4.5 commited on 24 days ago

Make zarr/numcodecs imports optional for ARM64 compatibility

6435a75

gary-boon Claude Opus 4.5 commited on 24 days ago

Skip zarr/numcodecs in Spark build (ARM64 incompatible)

d129e37

gary-boon Claude Opus 4.5 commited on 24 days ago

Fix numcodecs ARM64 compatibility in Dockerfile.spark

772fc80

gary-boon Claude Opus 4.5 commited on 24 days ago

Fix Dockerfile.spark for CUDA 13.0 compatibility

a4927aa

gary-boon Claude Opus 4.5 commited on 24 days ago

Fix Dockerfile.spark for ARM64 architecture (DGX Spark)

9d00d33

gary-boon Claude Opus 4.5 commited on 24 days ago

Add GPU-enabled Dockerfile for Spark

9377cd8

gary-boon Claude Opus 4.5 commited on 24 days ago

Fix Dockerfile: add build-essential for numcodecs compilation

3b5c3ac

gary-boon Claude Opus 4.5 commited on 24 days ago

Phase 1: DGX Spark infrastructure

a2bd186

gary-boon Claude Opus 4.5 commited on 24 days ago

Add Devstral + DGX Spark implementation plan

ab4534a

gary-boon Claude Opus 4.5 commited on 25 days ago

Make QKV hook robust against shape mismatches

343dd57

gary-boon Claude commited on Nov 18, 2025

Fix research attention endpoint model compatibility

f5ba954

gary-boon Claude commited on Nov 18, 2025

Fix zarr/numcodecs version compatibility

9e9dc34

gary-boon Claude commited on Nov 18, 2025

Add zarr to requirements.txt for storage module

f54e3f9

gary-boon Claude commited on Nov 18, 2025

Add research attention analysis endpoint with real CodeGen tokenization

8f63685

gary-boon Claude commited on Nov 18, 2025

Add research attention analysis endpoints with Q/K/V extraction

37ed739

gary-boon Claude commited on Nov 13, 2025

Fix ablation study for Code Llama compatibility

cd300ee

gary-boon Claude commited on Oct 31, 2025

Fix model info endpoint for Code Llama compatibility

7dd568f

gary-boon Claude commited on Oct 31, 2025

Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes

ed40a9a

gary-boon Claude commited on Oct 30, 2025

Add Claude Code configuration

03971da

gary-boon Claude commited on Oct 25, 2025

Fix pyarrow compatibility issue with datasets library

1680fda

gary-boon Claude commited on Sep 16, 2025

Fix syntax error in swe_bench_service.py

9dbec03

gary-boon Claude commited on Sep 16, 2025

Remove all mock data from SWE-bench - real data only

c0d95bf

gary-boon Claude commited on Sep 16, 2025

Add GitHub URLs and improve mock data for SWE-bench

22c69fa

gary-boon Claude commited on Sep 16, 2025

Fix SWE-bench service to gracefully handle dataset loading failures

ae9e159

gary-boon Claude commited on Sep 16, 2025

Fix SWE-bench service to return full problem statements

1d23728

gary-boon Claude commited on Sep 16, 2025

Add SWE-bench integration and improve backend routing

4444ae2

gary-boon Claude commited on Sep 15, 2025

Consolidate HuggingFace deployment into security workflow

07be0bf

gary-boon Claude commited on Sep 12, 2025

Add GitHub Action to deploy to both CPU and GPU HuggingFace Spaces

8b77dd5

gary-boon Claude commited on Sep 12, 2025

Add layer_stride parameter for PromptDiff optimization

5aed1a9

gary-boon Claude commited on Sep 12, 2025

Capture complete attention patterns after generation

992dc8c

gary-boon commited on Sep 11, 2025

Update HuggingFace Space description

c2f6135

gary-boon Claude commited on Sep 9, 2025

Fix: Use scaling approach instead of skipping layers

3c774b5

gary-boon Claude commited on Sep 2, 2025

Fix: Refine layer hook output format handling

4b03268

gary-boon Claude commited on Sep 2, 2025

Fix: Handle single-element tuple outputs in layer hook

9e42df9

gary-boon Claude commited on Sep 2, 2025

Fix: Correct layer hook output format for layer_norm compatibility

070f9b8

gary-boon Claude commited on Sep 1, 2025

Commit History

fix: Use eager attention for output_attentions support 5333b21

fix: Skip heavy ML deps in CI security checks ba27c0c

fix: Update torch to 2.3+ for transformers compatibility 1b73605

fix: Update transformers for Devstral support b788304

docs: Mark GPU HF Space Devstral deployment complete 65c6e2e

docs: Update phased plan with Phase 2/2b/2c completion status 688efad

Add vocabSize to modelInfo response 499afba

Update .env.spark.example: TORCH_DTYPE now auto-detected 543454f

Add recommended_dtype to model configs 62525b2

Phase 2: Add Devstral backend support 9080f28

Update plan: Phase 1 paused due to GB10 GPU support e694533

Add DEVICE env var to force CPU mode on DGX Spark 5f122aa

Use NGC PyTorch 24.08 for Python 3.10 compatibility a2875a2

Use NVIDIA NGC PyTorch container for GB10 support a4cfbff

Try PyTorch nightly for GB10/sm_121 GPU support a009a49

Make zarr/numcodecs imports optional for ARM64 compatibility 6435a75

Skip zarr/numcodecs in Spark build (ARM64 incompatible) d129e37

Fix numcodecs ARM64 compatibility in Dockerfile.spark 772fc80

Fix Dockerfile.spark for CUDA 13.0 compatibility a4927aa

Fix Dockerfile.spark for ARM64 architecture (DGX Spark) 9d00d33

Add GPU-enabled Dockerfile for Spark 9377cd8

Fix Dockerfile: add build-essential for numcodecs compilation 3b5c3ac

Phase 1: DGX Spark infrastructure a2bd186

Add Devstral + DGX Spark implementation plan ab4534a

Make QKV hook robust against shape mismatches 343dd57

Fix research attention endpoint model compatibility f5ba954

Fix zarr/numcodecs version compatibility 9e9dc34

Add zarr to requirements.txt for storage module f54e3f9

Add research attention analysis endpoint with real CodeGen tokenization 8f63685

Add research attention analysis endpoints with Q/K/V extraction 37ed739

Fix ablation study for Code Llama compatibility cd300ee

Fix model info endpoint for Code Llama compatibility 7dd568f

Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes ed40a9a

Add Claude Code configuration 03971da

Fix pyarrow compatibility issue with datasets library 1680fda

Fix syntax error in swe_bench_service.py 9dbec03

Remove all mock data from SWE-bench - real data only c0d95bf

Add GitHub URLs and improve mock data for SWE-bench 22c69fa

Fix SWE-bench service to gracefully handle dataset loading failures ae9e159

Fix SWE-bench service to return full problem statements 1d23728

Add SWE-bench integration and improve backend routing 4444ae2

Consolidate HuggingFace deployment into security workflow 07be0bf

Add GitHub Action to deploy to both CPU and GPU HuggingFace Spaces 8b77dd5

Add layer_stride parameter for PromptDiff optimization 5aed1a9

Capture complete attention patterns after generation 992dc8c

Update HuggingFace Space description c2f6135

Fix: Use scaling approach instead of skipping layers 3c774b5

Fix: Refine layer hook output format handling 4b03268

Fix: Handle single-element tuple outputs in layer hook 9e42df9

Fix: Correct layer hook output format for layer_norm compatibility 070f9b8

fix: Use eager attention for output_attentions support

5333b21

fix: Skip heavy ML deps in CI security checks

ba27c0c

fix: Update torch to 2.3+ for transformers compatibility

1b73605

fix: Update transformers for Devstral support

b788304

docs: Mark GPU HF Space Devstral deployment complete

65c6e2e

docs: Update phased plan with Phase 2/2b/2c completion status

688efad

Add vocabSize to modelInfo response

499afba

Update .env.spark.example: TORCH_DTYPE now auto-detected

543454f

Add recommended_dtype to model configs

62525b2

Phase 2: Add Devstral backend support

9080f28

Update plan: Phase 1 paused due to GB10 GPU support

e694533

Add DEVICE env var to force CPU mode on DGX Spark

5f122aa

Use NGC PyTorch 24.08 for Python 3.10 compatibility

a2875a2

Use NVIDIA NGC PyTorch container for GB10 support

a4cfbff

Try PyTorch nightly for GB10/sm_121 GPU support

a009a49

Make zarr/numcodecs imports optional for ARM64 compatibility

6435a75

Skip zarr/numcodecs in Spark build (ARM64 incompatible)

d129e37

Fix numcodecs ARM64 compatibility in Dockerfile.spark

772fc80

Fix Dockerfile.spark for CUDA 13.0 compatibility

a4927aa

Fix Dockerfile.spark for ARM64 architecture (DGX Spark)

9d00d33

Add GPU-enabled Dockerfile for Spark

9377cd8

Fix Dockerfile: add build-essential for numcodecs compilation

3b5c3ac

Phase 1: DGX Spark infrastructure

a2bd186

Add Devstral + DGX Spark implementation plan

ab4534a

Make QKV hook robust against shape mismatches

343dd57

Fix research attention endpoint model compatibility

f5ba954

Fix zarr/numcodecs version compatibility

9e9dc34

Add zarr to requirements.txt for storage module

f54e3f9

Add research attention analysis endpoint with real CodeGen tokenization

8f63685

Add research attention analysis endpoints with Q/K/V extraction

37ed739

Fix ablation study for Code Llama compatibility

cd300ee

Fix model info endpoint for Code Llama compatibility

7dd568f

Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes

ed40a9a

Add Claude Code configuration

03971da

Fix pyarrow compatibility issue with datasets library

1680fda

Fix syntax error in swe_bench_service.py

9dbec03

Remove all mock data from SWE-bench - real data only

c0d95bf

Add GitHub URLs and improve mock data for SWE-bench

22c69fa

Fix SWE-bench service to gracefully handle dataset loading failures

ae9e159

Fix SWE-bench service to return full problem statements

1d23728

Add SWE-bench integration and improve backend routing

4444ae2

Consolidate HuggingFace deployment into security workflow

07be0bf

Add GitHub Action to deploy to both CPU and GPU HuggingFace Spaces

8b77dd5

Add layer_stride parameter for PromptDiff optimization

5aed1a9

Capture complete attention patterns after generation

992dc8c

Update HuggingFace Space description

c2f6135

Fix: Use scaling approach instead of skipping layers

3c774b5

Fix: Refine layer hook output format handling

4b03268

Fix: Handle single-element tuple outputs in layer hook

9e42df9

Fix: Correct layer hook output format for layer_norm compatibility

070f9b8