如何解析未引用的JSON字符串

时间:2022-03-08 17:03:15

I have a JSON string where neither the keys nor the values are quoted and I would like to transform it to a correctly formatted JSON.

我有一个JSON字符串,其中既没有引用键也没有引用值,我想将它转换为一个正确格式化的JSON。

{basic:{0:{index:0, license:t, nameSID:n, image:"img_left", descriptionSID:t, category:r}}

Is there a Java library that could handle it? I tried Jackson but it does not work.

有Java库可以处理它吗?我试过杰克逊,但没用。

regards, Jan

问候,简

3 个解决方案

#1


5  

You can use JsonParser.Feature.ALLOW_UNQUOTED_FIELD_NAMES with Jackson to allow unquoted field names:

您可以使用JsonParser.Feature。允许使用Jackson的ALLOW_UNQUOTED_FIELD_NAMES允许不引用字段名:

JsonFactory factory = new JsonFactory();
factory.configure(JsonParser.Feature.ALLOW_UNQUOTED_FIELD_NAMES, true);
JsonParser jp = factory.createJsonParser(new FileInputStream("content.json"));

#2


0  

Not sure if you got around to writing you own parser or not, but I did.

不确定您是否开始编写自己的解析器,但我确实这样做了。

https://github.com/ischumacher/rsjp

https://github.com/ischumacher/rsjp

Here is example usage with your example JSON:

下面是示例JSON的示例用法:

String str = "{basic:{0:{index:0, license:t, nameSID:n, image:\"img_left\", descriptionSID:t, category:r}}";
Map<String, Object> jso = Json.parseJSON(str);
System.out.println(jso);
System.out.println(Json.get(jso, "basic", "0", "image"));

Here is the output:

这是输出:

{
   basic: 
   {
      0: 
      {
         index: 0, 
         license: t, 
         nameSID: n, 
         image: img_left, 
         descriptionSID: t, 
         category: r
      }
   }
}

img_left

#3


0  

I looked into how one might go about customizing Jackson to deal with non-quoted field values (vs. field names). Even though I ended-up writing a hack instead, I'm posting my breadcrumb trail here for others. My code spelunking was done in Jackson 2.7.2.

我研究了如何定制Jackson以处理不引用的字段值(与字段名)。尽管我最后还是写了一篇黑客文章,但我还是把我的面包屑路径贴在了这里。我的代码spelunking是在Jackson 2.7.2中完成的。

Jackson Core comes with two concrete implementations of the JsonParser interface:

Jackson Core提供了JsonParser接口的两个具体实现:

  • ReaderBasedJsonParser, a parser for character streams (encoding independent)
  • ReaderBasedJsonParser,字符流解析器(独立编码)
  • UTF8StreamJsonParser, a parser optimized for UTF-8 byte streams
  • UTF8StreamJsonParser,为UTF-8字节流优化的解析器

The code in these two classes is quite redundant, likely out of necessity. Each class has a method which is called by nextToken() when an unexpected character is encountered. ReaderBasedJsonParser's is named _handleOddValue() and UTF8StreamJsonParser's is _handleUnexpectedValue(). Stuff like accepting "NaN" as a numeric value and allowing single-quoted string values happens in here.

这两个类中的代码是相当多余的,可能是出于需要。每个类都有一个方法,当遇到意外字符时由nextToken()调用。ReaderBasedJsonParser命名为_handleOddValue(),而UTF8StreamJsonParser命名为_handleUnexpectedValue()。像接受“NaN”作为数值并允许单引号字符串值这样的事情在这里发生。

My plan (before I came to my senses and realized an awful hack would suffice for my short-term needs) was to subclass one/both of these parsers and override the methods above to handle unquoted string values. Because this method is called when the input stream is in the context of a field value (just after recognizing the colon), it should be possible to read forward until a comma or right curly brace is encountered and count everything read up to that point as a string value. This code is tricky to write as it requires an understanding of Jackson's buffering strategy, the parser's architecture (the current pointer into the current buffer is an instance variable), etc.

我的计划(在我意识到并意识到一个可怕的hack将满足我的短期需求之前)是对一个/两个解析器进行子类化,并重写上面的方法来处理未引用的字符串值。因为当输入流位于字段值的上下文中(仅在识别冒号之后)时,应该可以调用此方法,直到遇到逗号或右花括号并将该点的所有内容作为字符串值进行计数。这段代码很难编写,因为它需要理解Jackson的缓冲策略、解析器的体系结构(当前的指针指向当前缓冲区是一个实例变量)等等。

To make an ObjectMapper use this custom parser, one must subclass JsonFactory and override the _createParser() method with one which instantiates it. More work might be required to make both the regular and UTF-8 parser work correctly, although it's sufficient to force the use of the regular parser if performance isn't a concern. Then, an instance of this custom JsonFactory may be passed into ObjectMapper's constructor.

要使ObjectMapper使用这个自定义语法分析器,必须要子类化JsonFactory并使用实例化它的_createParser()方法。可能需要进行更多的工作,以使常规和UTF-8解析器工作正常,但如果不关心性能,就足以强制使用常规解析器。然后,该自定义JsonFactory的一个实例可能被传递到ObjectMapper的构造函数中。

Hope this helps someone.

希望这可以帮助别人。

#1


5  

You can use JsonParser.Feature.ALLOW_UNQUOTED_FIELD_NAMES with Jackson to allow unquoted field names:

您可以使用JsonParser.Feature。允许使用Jackson的ALLOW_UNQUOTED_FIELD_NAMES允许不引用字段名:

JsonFactory factory = new JsonFactory();
factory.configure(JsonParser.Feature.ALLOW_UNQUOTED_FIELD_NAMES, true);
JsonParser jp = factory.createJsonParser(new FileInputStream("content.json"));

#2


0  

Not sure if you got around to writing you own parser or not, but I did.

不确定您是否开始编写自己的解析器,但我确实这样做了。

https://github.com/ischumacher/rsjp

https://github.com/ischumacher/rsjp

Here is example usage with your example JSON:

下面是示例JSON的示例用法:

String str = "{basic:{0:{index:0, license:t, nameSID:n, image:\"img_left\", descriptionSID:t, category:r}}";
Map<String, Object> jso = Json.parseJSON(str);
System.out.println(jso);
System.out.println(Json.get(jso, "basic", "0", "image"));

Here is the output:

这是输出:

{
   basic: 
   {
      0: 
      {
         index: 0, 
         license: t, 
         nameSID: n, 
         image: img_left, 
         descriptionSID: t, 
         category: r
      }
   }
}

img_left

#3


0  

I looked into how one might go about customizing Jackson to deal with non-quoted field values (vs. field names). Even though I ended-up writing a hack instead, I'm posting my breadcrumb trail here for others. My code spelunking was done in Jackson 2.7.2.

我研究了如何定制Jackson以处理不引用的字段值(与字段名)。尽管我最后还是写了一篇黑客文章,但我还是把我的面包屑路径贴在了这里。我的代码spelunking是在Jackson 2.7.2中完成的。

Jackson Core comes with two concrete implementations of the JsonParser interface:

Jackson Core提供了JsonParser接口的两个具体实现:

  • ReaderBasedJsonParser, a parser for character streams (encoding independent)
  • ReaderBasedJsonParser,字符流解析器(独立编码)
  • UTF8StreamJsonParser, a parser optimized for UTF-8 byte streams
  • UTF8StreamJsonParser,为UTF-8字节流优化的解析器

The code in these two classes is quite redundant, likely out of necessity. Each class has a method which is called by nextToken() when an unexpected character is encountered. ReaderBasedJsonParser's is named _handleOddValue() and UTF8StreamJsonParser's is _handleUnexpectedValue(). Stuff like accepting "NaN" as a numeric value and allowing single-quoted string values happens in here.

这两个类中的代码是相当多余的,可能是出于需要。每个类都有一个方法,当遇到意外字符时由nextToken()调用。ReaderBasedJsonParser命名为_handleOddValue(),而UTF8StreamJsonParser命名为_handleUnexpectedValue()。像接受“NaN”作为数值并允许单引号字符串值这样的事情在这里发生。

My plan (before I came to my senses and realized an awful hack would suffice for my short-term needs) was to subclass one/both of these parsers and override the methods above to handle unquoted string values. Because this method is called when the input stream is in the context of a field value (just after recognizing the colon), it should be possible to read forward until a comma or right curly brace is encountered and count everything read up to that point as a string value. This code is tricky to write as it requires an understanding of Jackson's buffering strategy, the parser's architecture (the current pointer into the current buffer is an instance variable), etc.

我的计划(在我意识到并意识到一个可怕的hack将满足我的短期需求之前)是对一个/两个解析器进行子类化,并重写上面的方法来处理未引用的字符串值。因为当输入流位于字段值的上下文中(仅在识别冒号之后)时,应该可以调用此方法,直到遇到逗号或右花括号并将该点的所有内容作为字符串值进行计数。这段代码很难编写,因为它需要理解Jackson的缓冲策略、解析器的体系结构(当前的指针指向当前缓冲区是一个实例变量)等等。

To make an ObjectMapper use this custom parser, one must subclass JsonFactory and override the _createParser() method with one which instantiates it. More work might be required to make both the regular and UTF-8 parser work correctly, although it's sufficient to force the use of the regular parser if performance isn't a concern. Then, an instance of this custom JsonFactory may be passed into ObjectMapper's constructor.

要使ObjectMapper使用这个自定义语法分析器,必须要子类化JsonFactory并使用实例化它的_createParser()方法。可能需要进行更多的工作,以使常规和UTF-8解析器工作正常,但如果不关心性能,就足以强制使用常规解析器。然后,该自定义JsonFactory的一个实例可能被传递到ObjectMapper的构造函数中。

Hope this helps someone.

希望这可以帮助别人。