If you are interested in using #codellama or #wizardcoder with #vscode, I have a foss #vscode / #vscodium extension which allows for fairly flexible code editing using these models.
https://github.com/balisujohn/localpilot
Since it uses #text-generation-webui as a backend, it's compatible with machines with and without GPUs and doesn't require a Docker server like Fauxpilot.
This demo is local CPU inference only with a laptop i7, with the model WizardCoder python 13b.