Incorrect language detection leads to annoying recommendation #143901

sbatten · 2022-02-24T17:56:22Z

Typing out manually

class Foo {
	private foo: Foo
	constructor() {
		this.foo = ''
	}
}

TylerLeonhardt · 2022-02-24T21:00:07Z

Related to this, I also get C++ guessed if I open a new untitled file and type:

yes just one character. I was attempting to type <html which does guess correctly, but I feel like those first few characters that guess C++ is a little much.

For my issue, maybe we wait for like 5 characters at least?

TylerLeonhardt · 2022-02-24T23:57:21Z

I put my comment in a new issue since I think they're not going to require the same fix.

JacksonKearl · 2022-02-28T17:20:17Z

Added a minimum amount of text used when guessing which might help here. Getting a confidence score out of the model is a better fix.

I wonder if we should revisit how we prompt for ext recommendations in untitled files. A "This Looks Like C++, Would you like to install relevant extensions? Yes / No / Not C++ " would be good for analysis of model performance as well.

sbatten · 2022-02-28T18:37:35Z

I wonder if we should be prompting at all in this case. If the user isn't even saving the file yet, do we really think they want to install extensions?

JacksonKearl · 2022-02-28T18:40:26Z

Yeah I thought about that as well, I'd be fine not showing ext installs for Untitled files. @digitarald what do you think?

digitarald · 2022-03-01T01:48:30Z

Agreed, not showing it for untitled buffers makes sense – until they saved it.

TylerLeonhardt · 2022-03-01T04:25:31Z

I don't know... I feel conflicted. As a new user to VS Code, I might:

Click "New File" which gives me an untitled file
Paste in my C code from my homework assignment
wonder why all my friends say to use VS Code because all it did was made the code look pretty

The language detection was to help new users be on that happy path and if we take away that notification, I'm afraid they won't get guided enough down that path.

Now maybe turning the notification into something less aggressive... or perhaps delaying the notification for some amount of time... that might be a happy medium.

digitarald · 2022-03-01T05:16:36Z

Usertesting showed a lot of confusion with untitled buffers and not getting rich language features, so I agree with @TylerLeonhardt's argument.

As far as I understand: The model does bias against recently used file types ↪ Detected file types are likely to have been opened before ↪ So the get=the-extension=for-this-type recommendation might not be that common. Is that right?

sbatten · 2022-03-01T17:13:29Z

@digitarald @TylerLeonhardt I'm skeptical, but only because I don't think I have all the information. Not getting rich language features in an untitled buffer where you've never saved a single cpp file seems like a reasonable scenario. Not getting rich language features when you have used c++ in the past and installed the extension but we didn't detect the language is more confusing.

If this is a new user doing their homework, they are going to need to save this file to run it, at least without some extension installed which implies they already know a bit about how things work in VS Code. I feel like save is the right signal for showing extension recommendations.

TylerLeonhardt · 2022-03-01T18:52:22Z

If this is a new user doing their homework, they are going to need to save this file to run it

About ~1 yr ago, @isidorn added a "save on run" feature. The intent (my understanding) is that this was added because a lot of new users were "running" untitled files without saving them to an actual concrete file and extensions were having to handle this case (and some weren't causing confusion to the user). This was mostly the case because new users weren't quite grasping that an untitled file wasn't a real file on disk.

Maybe @isidorn has some thoughts on this

JacksonKearl · 2022-03-01T19:00:02Z

I think save on run is equally valid for files on disk -- if you don't have autosave enabled it's easy to make a change and run without saving in between then get stuck in a undesirable "why aren't my changes doing anything" state.

JacksonKearl · 2022-03-01T19:01:47Z

That being said, I'm looking into gauging model confidence this iteration, so I could see a situation where we only show popups when the model is very confident (or the language was explicitly selected).

sbatten · 2022-03-01T21:57:50Z

The save on run feature already handles this...

TylerLeonhardt · 2022-03-02T00:19:42Z

I don't know if we can assume they get that far.

isidorn · 2022-03-02T10:24:14Z

@TylerLeonhardt correct, we added a sev on run for what you pointed out.

I like the flow of untitled file -> language is auto detected -> recommendation is shown. I think it is smooth for new users.
Can we just try to fine tune the model confidence this iteration as @JacksonKearl says? So that we detect more strictly. That could fix this scenario.
I do not like the idea of splitting the behaviour based on model confidence (could be confusing). IMHO model confidence should only influence if we auto detect or not.

JacksonKearl · 2022-03-17T00:16:39Z

The model now has some better confidence heuristics built in.

sbatten assigned JacksonKearl Feb 24, 2022

TylerLeonhardt mentioned this issue Feb 24, 2022

Related to this, I also get C++ guessed if I open a new untitled file and type: #143930

Closed

JacksonKearl added bug Issue identified by VS Code Team member as probable bug languages-guessing Language guessing issues labels Feb 28, 2022

JacksonKearl added this to the March 2022 milestone Feb 28, 2022

kieferrm mentioned this issue Mar 14, 2022

Iteration Plan for March 2022 #144519

Closed

87 tasks

JacksonKearl closed this as completed Mar 17, 2022

rzhao271 added the verified Verification succeeded label Mar 24, 2022

github-actions bot locked and limited conversation to collaborators May 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect language detection leads to annoying recommendation #143901

Incorrect language detection leads to annoying recommendation #143901

sbatten commented Feb 24, 2022

TylerLeonhardt commented Feb 24, 2022

TylerLeonhardt commented Feb 24, 2022

JacksonKearl commented Feb 28, 2022

sbatten commented Feb 28, 2022

JacksonKearl commented Feb 28, 2022

digitarald commented Mar 1, 2022

TylerLeonhardt commented Mar 1, 2022

digitarald commented Mar 1, 2022

sbatten commented Mar 1, 2022

TylerLeonhardt commented Mar 1, 2022

JacksonKearl commented Mar 1, 2022

JacksonKearl commented Mar 1, 2022

sbatten commented Mar 1, 2022

TylerLeonhardt commented Mar 2, 2022

isidorn commented Mar 2, 2022 •

edited

Loading

JacksonKearl commented Mar 17, 2022

Incorrect language detection leads to annoying recommendation #143901

Incorrect language detection leads to annoying recommendation #143901

Comments

sbatten commented Feb 24, 2022

TylerLeonhardt commented Feb 24, 2022

TylerLeonhardt commented Feb 24, 2022

JacksonKearl commented Feb 28, 2022

sbatten commented Feb 28, 2022

JacksonKearl commented Feb 28, 2022

digitarald commented Mar 1, 2022

TylerLeonhardt commented Mar 1, 2022

digitarald commented Mar 1, 2022

sbatten commented Mar 1, 2022

TylerLeonhardt commented Mar 1, 2022

JacksonKearl commented Mar 1, 2022

JacksonKearl commented Mar 1, 2022

sbatten commented Mar 1, 2022

TylerLeonhardt commented Mar 2, 2022

isidorn commented Mar 2, 2022 • edited Loading

JacksonKearl commented Mar 17, 2022

isidorn commented Mar 2, 2022 •

edited

Loading