Dumps String with newlines to Multiline? #236

simplesteph · 2019-03-09T15:03:01Z

How can I achieve this?

    print(toml.dumps({"test": "foo\n\nbar"}))

Should be

test = """
foo

bar
"""

The text was updated successfully, but these errors were encountered:

simplesteph · 2019-03-09T16:31:34Z

I created my own wrapping encoder which is super dirty but works. I'm not going to do a PR because I'm not sure it fully respects the specs on the edge cases (especially if the enclosed string has triple quotes), but it works for my use case as I don't have triple quotes in my strings

import datetime
import re
import sys

from toml.decoder import InlineTableDict
from toml.encoder import _dump_str, _dump_float, _dump_time

if sys.version_info >= (3,):
    unicode = str

def _dump_str_new(v):
    multilines = v.split('\n')
    if len(multilines) > 1:
        return unicode('"""\n' + v.strip() + '\n"""')
    else:
        return _dump_str(v)

class MyTomlEncoder(object):

    def __init__(self, _dict=dict, preserve=False):
        self._dict = _dict
        self.preserve = preserve
        self.dump_funcs = {
            str: _dump_str_new,
            unicode: _dump_str_new,
            list: self.dump_list,
            bool: lambda v: unicode(v).lower(),
            int: lambda v: v,
            float: _dump_float,
            datetime.datetime: lambda v: v.isoformat().replace('+00:00', 'Z'),
            datetime.time: _dump_time,
            datetime.date: lambda v: v.isoformat()
        }

    def get_empty_table(self):
        return self._dict()

    def dump_list(self, v):
        retval = "["
        for u in v:
            retval += " " + unicode(self.dump_value(u)) + ","
        retval += "]"
        return retval

    def dump_inline_table(self, section):
        """Preserve inline table in its compact syntax instead of expanding
        into subsection.

        https://github.com/toml-lang/toml#user-content-inline-table
        """
        retval = ""
        if isinstance(section, dict):
            val_list = []
            for k, v in section.items():
                val = self.dump_inline_table(v)
                val_list.append(k + " = " + val)
            retval += "{ " + ", ".join(val_list) + " }\n"
            return retval
        else:
            return unicode(self.dump_value(section))

    def dump_value(self, v):
        # Lookup function corresponding to v's type
        dump_fn = self.dump_funcs.get(type(v))
        if dump_fn is None and hasattr(v, '__iter__'):
            dump_fn = self.dump_funcs[list]
        # Evaluate function (if it exists) else return v
        return dump_fn(v) if dump_fn is not None else self.dump_funcs[str](v)

    def dump_sections(self, o, sup):
        retstr = ""
        if sup != "" and sup[-1] != ".":
            sup += '.'
        retdict = self._dict()
        arraystr = ""
        for section in o:
            section = unicode(section)
            qsection = section
            if not re.match(r'^[A-Za-z0-9_-]+$', section):
                if '"' in section:
                    qsection = "'" + section + "'"
                else:
                    qsection = '"' + section + '"'
            if not isinstance(o[section], dict):
                arrayoftables = False
                if isinstance(o[section], list):
                    for a in o[section]:
                        if isinstance(a, dict):
                            arrayoftables = True
                if arrayoftables:
                    for a in o[section]:
                        arraytabstr = "\n"
                        arraystr += "[[" + sup + qsection + "]]\n"
                        s, d = self.dump_sections(a, sup + qsection)
                        if s:
                            if s[0] == "[":
                                arraytabstr += s
                            else:
                                arraystr += s
                        while d:
                            newd = self._dict()
                            for dsec in d:
                                s1, d1 = self.dump_sections(d[dsec], sup +
                                                            qsection + "." +
                                                            dsec)
                                if s1:
                                    arraytabstr += ("[" + sup + qsection +
                                                    "." + dsec + "]\n")
                                    arraytabstr += s1
                                for s1 in d1:
                                    newd[dsec + "." + s1] = d1[s1]
                            d = newd
                        arraystr += arraytabstr
                else:
                    if o[section] is not None:
                        retstr += (qsection + " = " +
                                   unicode(self.dump_value(o[section])) + '\n')
            elif self.preserve and isinstance(o[section], InlineTableDict):
                retstr += (qsection + " = " +
                           self.dump_inline_table(o[section]))
            else:
                retdict[qsection] = o[section]
        retstr += arraystr
        return (retstr, retdict)

and to use

            text = toml.dumps(data, MyTomlEncoder())

retorquere · 2019-05-09T15:11:55Z

When I use this, I get

  File "/usr/local/lib/python3.7/site-packages/toml/encoder.py", line 47, in dumps
    addtoretval, sections = encoder.dump_sections(o, "")
TypeError: dump_sections() missing 1 required positional argument: 'sup'

vvasuki · 2021-04-06T03:51:14Z

My minimal implementation https://github.com/sanskrit-coders/sanskrit_data/blob/67e0999be6f8bf7fff761f0484141e03b9e551f4/sanskrit_data/toml_helper.py#L6

EternityForest · 2021-05-13T05:39:59Z

@vvasuki could we add a replace('"""','\"""') as a minimal way to correctly handle escaping any embedded triple quotes?

vvasuki · 2021-05-13T07:13:23Z

@vvasuki could we add a replace('"""','"""') as a minimal way to correctly handle escaping any embedded triple quotes?

Done (edited link in prior message.)

simplesteph changed the title ~~Dumps to Multiline?~~ Dumps String with newlines to Multiline? Mar 9, 2019

comodoro mentioned this issue Apr 26, 2019

String dump with correct escaping #244

Open

Skylion007 mentioned this issue Oct 1, 2020

Max Line Length for Pretty-Format-ToML / Don't reformat multiline strings? macisamuele/language-formatters-pre-commit-hooks#39

Closed

pradyunsg added component: encoder type: feature syntax: strings labels Apr 20, 2022

ZeroRin mentioned this issue Apr 22, 2023

Dump Multiline string to toml file #421

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dumps String with newlines to Multiline? #236

Dumps String with newlines to Multiline? #236

simplesteph commented Mar 9, 2019

simplesteph commented Mar 9, 2019 •

edited

Loading

retorquere commented May 9, 2019

vvasuki commented Apr 6, 2021 •

edited

Loading

EternityForest commented May 13, 2021

vvasuki commented May 13, 2021

Dumps String with newlines to Multiline? #236

Dumps String with newlines to Multiline? #236

Comments

simplesteph commented Mar 9, 2019

simplesteph commented Mar 9, 2019 • edited Loading

retorquere commented May 9, 2019

vvasuki commented Apr 6, 2021 • edited Loading

EternityForest commented May 13, 2021

vvasuki commented May 13, 2021

simplesteph commented Mar 9, 2019 •

edited

Loading

vvasuki commented Apr 6, 2021 •

edited

Loading