well I can't think of any right now, maybe with a command line. I don't use them so somebody else will have to answer for this.
or you could extract both sound taks and then add them the other way round japanese first and english second. You can do that with virtualdub ( I think you need a plugin for ogm files).
the thing is it'll be longer and harder than just writting a bsi file